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MOLECULAR CLONES WITH. MUTATED 
HIV GAG/POL. SIV GAG AND SIV ENV GENES 

— ; . — - — : ' ■ 

I. TECHNICAL FIELD 

The invention relates to nucleic acids comprising mutated HIV-1 
gag/pol and SIV gag gene sequences which are capable of being expressed 
independently of any SIV or HIV regulatory factors. The invention also relates to 
nucleic acids comprising a mutated SIV env gene sequence, which is capable of 
being expressed independently of any SIV or HIV regulatory factors. The preferred 
nucleic acids of the invention are capable of producing infectious viral particles. 

The invention also relates to vectors, vector systems and host cells 
comprising the mutated HIV-1 gag, HIV-1 pol, SIV gag and/or SIV env gene 
sequences. The invention also relates host cells comprising these nucleic acids 
and/or vectors or vector systems. The invention also relates to the use of these 

■ 

nucleic acids, vectors, vector systems and/or host cells for use in gene therapy or as 
vaccines. 

II. BACKGROUND 

Until recently, gene therapy protocols have often relied on vectors 
derived from retroviruses, such as murine leukemia virus (MLV). These vectors are 
useful because the genes they transduce are integrated into the genome of the target 
cells, a desirable feature for long-term expression. However, these retroviral vectors 
can only transduce dividing cells, which limits their use for in vivo gene transfer in 
nonproliferating cells, such as hepatocytes, myofibers, hematopoietic stem cells, and 
neurons. 

Lentiviruses are a type of retrovirus that can infect both dividing and 
nondividing cells. They have proven extremely efficient at providing long-term 
gene expression (for up to 6 months) in a variety of nondividing cells (such as, 
neurons and macrophages) in animal models. See, e.g., Amado et al., Science 
285:674-676 (July 1999). It has been proposed that the optimal gene transfer 
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system would include a vector based on HIV, or other lentivirus, that can integrate 
into the genome of nonproliferating cells. Because retroviruses integrate in the 
genome of the target cells, repeated transduction is unnecessary. Therefore, in 
contrast to an adenoviral vector capable of in vivo gene delivery, problems linked to 
the humoral response to injected viral antigens can be avoided. See, e.g., Naldini et 
aL, Science, 272:263-267 (1996), p. 263. 

HIV and other lentiviruses have a complex genome that, in addition 
to the essential structural genes (env, gag, and pol), contains regulatory (tat and rev) 
and accessory genes (vpr, vif, vpu, and nef). HIV has evolved to efficiently infect 
and express its genes in human cells, and is able to infect nondividing cells such as 
macrophages because its preintegration complex can traverse the intact membrane 
of the nucleus in the target cell. This complex contains, in addition to the viral 
DNA, the enzyme integrase, the product of the vpr gene, and a protein encoded by 
the gag gene called matrix. The matrix protein enables the preintegration complex 
to pass into the nucleus to access the host DNA. Lentiviruses cannot efficiently 
transduce truly quiescent cells (cells in the G 0 state). However, unlike murine 
retroviral vectors, in addition to being able to infect dividing cells, HTV-based 
vectors can achieve effective and sustained transduction and expression of 
therapeutic genes in nondividing cells, such as hematopoietic stem cells and in 
terminally differentiated cells such as neurons, retinal photoreceptors, muscle, and 
liver cells. See, e.g., Amado et al. (July 1999) and Klimatcheva et aL, Frontiers in 
Bioscience 4:d48 1-496 (June 1999), and the references cited therein. 

Although lentiviral vectors can be efficient gene delivery vehicles, 
there are safety concerns due to their origin. Therefore, the field has turned its 
attention to the development of vectors and production systems with built-in safety 
features to prevent the emergence of replication competent lentivirus (RCL). For 
example, in most laboratory applications, lentiviral vectors are generally created in a 
transient system in which a cell line is transfected with three separate constructs: a 
packaging construct, a transfer construct, and an envelope encoding construct. The 
packaging construct contains the elements necessary for vector packaging (except 
for env) and the enzymes required to generate vector particles. The transfer 
construct contains genetic cis-acting sequences necessary for the vector to infect the 
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target cell and for transfer of the therapeutic (or reporter) gene. The lentivirus env 
gene is generally deleted from the packaging construct and instead the envelope 
gene of a different virus is supplied in a third vector "the env-coding vector", 
although the lentiviruses env gene may be used if it is desired that the vector be 
intended to infect CD4 + T cells. A commonly used envelope gene is that encoding 
the G glycoprotein of the vesicular stomatitis virus (VSV-G), which can infect a 
wide variey of cells and in addition confers stability to the particle and permits the 
vector to be concentrated to high titers (see, e.g., Naldini et al., Science 272:263-267 
(1996) and Akkina et al. J. ViroL 70:2581 (1996). The use of three separate 
constructs and the absence of overlapping sequences between them minimizes the 
possibility of recombination during lentivirus (transfer) vector production. In 
addition, because no viral proteins are expressed by the lentiviral (transfer) vector 
itself, they do not trigger an effective immune response against cells expressing 
vector in animal models (a particular problem with vectors based on adenovirus). 
See, e.g., Amado et al., Science 285:674-676 (July 1999) and the references cited 
therein. See also Naldini et al. Science 272:263-267 (1996). 

The initial packaging plasmids contained most HIV genes except for 
env. In an effort to improve safety, subsequent HIV vectors have been produced in 
which the packaging plasmid is devoid of all accessory genes. This process does 
not interfere with efficient vector production and significantly increases the safety 
of the system because potential RCLs lack the accessory genes necessary for 
efficient replication of HIV in humans. Although these vectors can transduce 
growth-arrested cell lines and neurons in vivo, they have been reported to not 
efficiently transduce macrophages. The accessory gene vpr is believed to be 
necessary for HIV infection of these cells using these HIV vectors. See, Zufferey et 
al., Nature Biotechnol. 15:871-875 (1997). In contrast, as discussed later herein, 
the HIV-based lentiviral vectors of the present invention do not need any HIV 
accessory genes in order to be able to infect human macrophages and the other cells 
tested. 

The requirement of vpr or vif "for efficient transduction of liver cells 
has also been reported. See, e.g., Kafri et al., Nature Genet. 17:314 (1997). These 
results indicate that the requirement of accessory genes for efficient lentivirus- 
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mediated gene transfer is dependent on the type of cell chosen as target, suggesting 
that future applications of lentiviral vectors may involve vector constructs with 
different accessory genes, as needed. 

Zufferey et al., (1997) describe an HIV vector system in which the 
virulence genes, env, vif, vpr, vpu, and rce/have been deleted. This multiply 
attenuated vector conserved the ability to transduce growth-arrested cells and 
monocyte-derived macrophages in culture, and could efficiently deliver genes in 
vivo into adult neurons. The packaging plasmids described Zufferey et al. (1997) 
and Naldini et al. (1996) encode Rev and Tat, in addition to Gag and Pol. 

Lentiviral vectors engineered to become packaged into virions in the 
absence of the regulatory gene tat have also been described. See, e.g., Kim et al., J. 
Virol. 72:811-816 (1998) andMiyoshi et al. J. Virol. 72:8150-8157 (1998). In 
these vectors the tat gene has been removed from the packaging plasmid. Kim et al. 
state that tat is not necessary as long as the serial 5 5 LTR promoter is replaced with 
a strong constitutive promoter. It also has other advantages for HIV therapy. 
Replacement of the HIV-1 LTR with a constitutive HCMV promoter permits the use 
of anti-Tat molecules such as Tat transdominant mutants or Tat activation response 
element decoys as therapeutic agents, since they will not affect vector production, 
(see p. 814, col. 2). The removal of the tat gene eliminates an essential virulence 
factor that could contribute to a possible RCL. Kim et al. (1998) describe a vector 
system which does not contain tat, vif, vpr, vpu and nef. The preferred vector 
system includes the rev gene which, the authors state "with RRE, is required for 
efficient RNA handling in this system." (p. 811, col. 2). However, Kim et al. also 
constructed Rev independent constructs using CTE. Kim et al. state that the 
rev/RRE components could be removed by using a sequence such as the Mason- 
Pfizer monkey virus (MPMV) constitutive transport element (CTE), thereby 
eliminating all accessory proteins, but this leads to a significant reduction in titer. 

Srinivasakumar et al., J. Virol. 71:5841-5848 (1997) describes the 
generation of stable HIV-1 packaging lines that constitutively express high levels of 
HIV-1 structural proteins in either a Rev-dependent or a Rev-independent fashion. 
These cell lines were used to assess gene transfer by using a HIV-1 vector 
expressing the hygromycin B resistance gene and to study the effects of Rev, Tat, 
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and Nef on the vector titer. The Rev-independent cell lines were created by using 
gag-pol and env expression vectors that contain the MPMV CTE. This article 
describes the construction of four plasmids, among others: CMV gagpol-RRE and 
pCMVenv, which require Rev coexpression for HIV-1 structural gene expression, 
and pCMV gagpol-CTE and pCMVenv-CTE, which do not. To create Rev- 
containing and Rev-independent packaging, cell lines, CMT3 cells were transfected 
with vectors expressing Gag, Gag-Pol, and Env, using a calcium phosphate 
transfection procedure. 

By creating an HIV vector which contained the MPMV CTE 
(pTR167-CTE) and a packaging cell line which expressed the HIV structural 
proteins in a Rev-independent fashion, the authors were able to obtain a HTV vector 
system that functions completely without Rev. The titer of the vector obtained from 
this system was essentially the same as that obtained from a parallel system which 
contained Rev. The authors state that, in this context, the CTE seemed to substitute 
completely for Rev-RRE functions, similar to what was previously observed in 
transient-expression assays with Rev-dependent constructs. This is in contrast to 
situations where several rounds of HIV replication were measured. In those cases, 
titers from CTE-containing viruses were always reduced by at least 1 log unit 
compared to viruses utilizing Rev and the RRE. (See, Srinivasakumar et aL, p. 
5847). 

The authors state that the advantages of having a HIV vector system 
that works in the absence of Rev opens the possibility of using it as a delivery 
vehicle for intracellular immunization against Rev function. Genes encoding Rev 
antagonists that have dramatic inhibitory effects on HIV replication, such as Rev 
M10 or RRE decoys, could be introduced into an HIV vector and put into cells 
normally infectable by HIV. Expression of the "anti-Rev" gene would be expected 
to dampen HIV infection. Any residual HIV replication should lead to activation of 
the vector LTR (by Tat) and create a vector-derived RNA that would be packaged 
by proteins derived from the infectious virus. In this scenario, the wild-type virus 
would act as a helper that may allow the spread of vector particles to previously 
nonimmunized cells. Because of the additional vector spread, it is likely that this 
type of scheme will be more effective in modulating HIV infection in vivo than one 
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based on traditional retrovirus vectors. The authors state that they are currently 
testing this approach in model systems. (See, Srinivasakumar et al., p. 5847). 

Another development in the quest for a safe system is the so-called 
self -inactivating (SIN) vector. See, e.g., Yu et al., Proc Natl Acad Sci USA 
83:3194-8 (1986) andMiyoshi et al., J. Virol. 72:8150 (1998). In Yu et al., a 
retrovirus-derived vector SIN vector was designed for the transduction of whole 
genes into mammalian cells. The SIN vector of Yu et al. contains a deletion of 299 
base pairs in the 3' long terminal repeat (LTR), which includes sequences encoding 
the enhancer and promoter functions. When viruses derived from such vectors were 
used to infect NIH 3T3 cells, the deletion was transferred to the 5' LTR, resulting in 
the transcriptional inactivation of the pro virus in the infected cell. Introduction of a 
hybrid gene (human metallothionein-promoted c-fos) into cells via a SIN vector was 
not associated with rearrangements and led to the formation of an authentic mRNA 
transcript, which in some cases was induced by cadmium. The vector described in 
Miyoshi et al. also contains a deletion the 3' (downstream) LTR. A sequence within 
the upstream LTR serves as a promoter under which the viral genome is expressed. 
The deletion introduced in the downstream LTR is transferred to the upstream LTR 
during reverse transcription. This deletion inactivates the LTR promoter and 
eliminates the production of vector RNA. The gene (or genes) to be transferred 
(e.g., a reporter or therapeutic gene) is expressed from an exogenous viral or cellular 
promoter that is inserted into the lentivirus vector. An important safety feature of 
SIN vectors is that inactivation of the promoter activity of the LTR reduces the 
possibility of insertional mutagenesis (of the transfer vector) into the host genome. 
In addition, because the expression of the (transfer) vector RNA is eliminated, the 
potential for RCL production in the target cell is further minimized. SIN vectors 
should be particularly useful in gene transfer experiments designed to study the 
regulated expression of genes in mammalian cells. Absence of enhancer and 
promoter sequences in both LTRs of the integrated pro virus should also minimize 
the possibility of activating cellular oncogenes and may provide a safer alternative 
to be used in human gene therapy. Other modifications to enhance safety and 
specificity include the use of specific internal promoters that regulate gene 
expression, either temporally or with tissue or cell specificity. 
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Other strategies to improve safety in human studies would be to use 
nonhuman lenti viruses such as simian immunodeficiency virus, bovine 
immunodeficiency virus, or equine infectious anemia virus. Of these, vectors 
derived from the feline immunodeficiency virus have been engineered to efficiently 
transduce nondividing human cells. See, e.g., Poeschla et al., Nature Med. 4:354- 
357 (1998) and WO 99/15641. In addition, White et al., J. Virol. 73:2832-2840 
(April 1999) described lenti viral vectors using human and simian immunodeficient 
virus elements in attempt to improve safety by reducing the likelihood of 
recombination between packaging constructs and transfer constructs. 

The development of efficient packaging lines has proven challenging 
because expression of the VS V-G envelope and a number of HIV proteins is toxic to 
cells. Recently, a producer line has been designed in which the expression of 
packaging genes and VSV-G, and therefore the production of vector, can be turned 
on at will. Kafri et al., J. Virol. 73-576-584 (1999). The cell line can be expanded 
for scale-up vector production when the expression of toxic genes is turned off. 
This cell line produces high titer vector without generating RCL. Hematopoietic 
stem cells transduced with an HIV vector were transplanted into rhesus macaques as 
described by Donahue et al. Blood 92 (suppl. 1), abstract 4648.5 (1998) with at least 
a 14-month follow-up. At that time the procedure proved to be safe; all animals in 
the study have remained healthy without evidence of circulating HIV or vector. 
See, Amado et al., Science 285:674-676 (July 1999). 

Many gene therapy protocols have been designed to correct a number 
of inherited metabolic, infectious, or malignant diseases using the hematopoietic 
stem cell. This cell has the capacity to self-renew and to differentiate into all of the 
mature cells of the blood and immune systems. Many diseases that affect these 
systems could potentially be treated by the stable introduction of therapeutic genes 
into stem cells. Recently, lenti viral vectors were shown to bypass the need for ex 
vivo stem cell stimulation (which is necessary when using murine retroviral 
vectors), by mediating efficient gene transfer into very primitive human stem cells 
that contributed to stable, long-term reconstitution of SOD mouse bone marrow 
with many hematopoietic lineages. See, e.g., Miyoshi et al., Science 283:682 
(1999). Similarly, in a rhesus macaque model of autologous transplantation with 
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lentivirus-transduced stem cells, multilineage gene expression was found, 
suggesting transduction of an early blood cell progenitor under conditions of 
minimal stem cell stimulation, ordinarily insufficient for transduction with murine 
retroviruses. See, Donahue et al., Blood 92 (suppl. 1), abstract 4648.5 (1999) and 
Amado et al., Science 285:674-676 (July 1999). 

In HIV infection, another advantage of lentiviral vectors designed 
against HIV is their potential to be mobilized by HIV in the infected patient, 
because the virus supplies all of the necessary elements for packaging of the vector. 
If these mobilized vectors contained the HIV envelope, they could efficiently 
transfer their genes (for example, genes custom-designed to confer resistance 
against HIV) into CD4* T cells, protecting them from subsequent HIV infection. 
Lentiviral vectors can also be designed to efficiently express their genes only in 
CD4 + T cells that are infected with HIV (so called tatf-inducible vectors). In these 
vectors, all HIV genes, including tat and rev, are ablated; cis-acting sequences 
required for integration, expression, and packaging are retained, and expression is 
dependent on the activity of the HIV LTR (which requires transactivation by Tat). 
It has been shown that in this system, vector expression is induced efficiently upon 
HIV infection. Moreover, in the absence of genes that confer resistance against 
HIV, stable integration of this vector in permissive cell lines resulted in inhibition of 
HIV replication. Although the mechanism of HIV inhibition has not been 
completely elucidated, preliminary results suggest that this vector competes with 
HIV at the level of reverse transcription. See, An et al., J. ViroL, in press, and 
Amado et al., Science 285:674-676 (1999). 

A number of other potential medical applications, where the 
modification of the genetic material of quiescent cells could result in the prevention 
or reversal of a disease process, are beginning to be explored. For example, the 
finding that lentiviral vectors can mediate stable and long-term gene transfer by 
direct injection of vector into the rat and mouse retina has lent support to the notion 
of gene therapy for the treatment of retinitis pigmentosa. This degenerative disease 
of the retina is characterized by photoreceptor cell death, resulting in a slow 
progression to blindness. Mutations in the cGMP phosphodiesterase P subunit 
(PDE|3) gene of rod photoreceptors lead to an autosomal recessive form of retinitis 
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pigmentosa in humans, and in the rd mouse model of the disease. Previous studies 
have shown that adenovirus and adeno-associated virus-mediated PDEP subretinal 
gene transfer results in a delay in photoreceptor cell death. Using the rd mouse 
model, a recent study demonstrated that photoreceptors could be rescued in up to 
50% of eyes injected with a lentivirus vector containing the murine PDEf3 gene. In 
contrast with the short-term expression previously obtained with adenovirus vectors, 
PDEp expression in this study persisted for at least 24 weeks. This finding points to 
the potential success of gene therapy in a disease that currently lacks effective 
treatment. See, Takahashi et al., J. Virol., 73:7812-7816 (Sept. 1999) and Amado et 
al. Science, 285:674-676 (1999). 

In nature, the expression of gag, pol, and env of HIV-1 depends on 
the presence of the viral Rev protein. This dependence is, at least in part, due to the 
presence of negatively acting sequences (inhibitory or instability elements [INS]) 
located within unspliced and partially spliced mRNAs. The positive interaction of 
Rev with the Rev-responsive element [RRE] in these mRNAs counteracts the 
negative effects of the inhibitory sequences. 

None of the above references teach or suggest that the gag and/or pol 
genes described therein may be replaced with the gag and/or pol genes in which the 
inhibitory/instability have been mutated to render their expression Rev-idependent. 
Furthermore, there is no disclosure of the specific HIV-1 gaglpol or SIV gag 
mutated genes described herein. * 

The gag/pol clone of the invention was made using the method for 
eliminating inhibitory/instability regions from a gene as first described in U.S. 
patent application Serial No. 07/858,747, filed March 27, 1992 (which issued as US 
Patent No. 6,174,666) entitled "Method of Eliminating Inhibitory/Instability 
Regions from mRNA" and later described in a Continuation-in-Part ("CIP") 
application, filed as PCT application PCT/US93/02908 on March 29, 1993 and U.S. 
Patent Nos. 5,972,596 and 5,965,726. The disclosure of the CIP application was 
published as International Publication No. WO 93/20212 on October 14, 1993. 
(The disclosures of these patents and patent applications are specifically 
incorporated by reference herein in their entirety.) The method was also described 
in Schwartz et al., J. Viroh 66:7176-7182 (1992). 
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Schneider et al., J. Virol. 71:4892-4903 (1997), extend the work 
described in the patent applications and in Schwartz et al. by identifying and 
characterizing additional INS within gag, protease and pol genes and mutating them 
in a similar manner. Schneider et al. disclose nucleic acid constructs which contain 
completely mutated HIV-1 gag genes, but only partially mutated HIV-1 pol genes. 

Schneider et al. demonstrate that expression vectors containing an 
intact or nearly intact p55 8a8 region allow the production of immature viral particles 
in mammalian cells in the absence of any other HIV proteins. The introduction of 
additional mutations in the protease region allowed efficient production of 
Gag/protease, which resulted in processing of the Pr55 8ag precursor and production 
of mature Gag particles with a lentivirus-like conical-core structure. 

Schneider et al. disclose that Rev-independent expression vectors 
allow the efficient expression of Gag proteins in many cell lines that are not able to 
support efficient Rev-RRE-dependent rescue of these RNAs. Schneider et al. also 
disclose that gag/pol expression vectors may be important for vaccination 
approaches against HIV-1, since the gag/pol region is more conserved than is the 
env region and may be important for an effective immune response against HIV and 
for protection against infection. They also state that efficient HIV gene expression 
in many cells is also of interest for possible gene transfer experiments using 
lentiviral vectors in nondividing or slowly dividing cells, since HIV and the other 
lentiviruses are able to infect quiescent cells. 

Pavlakis et al., Natl Conf Hum Retroviruses Relat Infect (2nd). 
(1995), 91, state that Rev-independent Gag expression vectors were able to produce 
viral particles in human and mouse cells in the absence of any other HTV proteins, 
and that additional mutations in the pol region allowed the expression of the 
protease and the processing of the p55 gag precursor. Direct DNA injection of TAT 
and Rev independent Gag expression vectors in mouse muscle resulted in Gag 
expression detected by ELIS A and in anti-gag antibody response. Several Rev-and 
Tat- independent Gag expression cassettes were inserted into retroviral vectors and 
cell lines expressing Gag or Gag fragments that are dominant negative inhibitors of 
HIV-1 were constructed. 
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Shiver et al. (1996) describe the results of DNA vaccination of mice 
and non-human primates with mutated plasmid DNA encoding either mutated genes 
encoding HTV-1 gag (p55 gag) or env (gpl20 or gpl60). Both gag and env vaccine 
recipients exhibited antigen-specific cytotoxic and helper T lymphocyte (CTL, Th) 
responses. The results are stated to demonstrate that DNA vaccines elicited long- 
lived T cell responses in both mice and nonhuman primates that were disseminated 
throughout the lymphatics. 

HI. SUMMARY OF THE INVENTION 

The invention relates to nucleic acids comprising the nucleic acid 
sequence of the mutated HIV-1 gag/pol gene shown in Figure 1 (SEQUENCE ID 
NO:l) and vectors and vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the nucleic 
acid sequence of the mutated SIV gag gene shown in Figure 3 and vectors and 
vector systems comprising these nucleic acids. 

The invention also relates to nucleic acids comprising the mutated 
SIV env gene shown in Figure 17 and vectors and vector systems comprising these 
nucleic acids. 

The invention also relates to products produced by the nucleic acids, 
e.g., mRNA, protein, and infectious viral particles. 

The invention also relates to compositions comprising these nucleic 
acids and/or their expression products. 

The invention also relates to host cells comprising these nucleic 
acids, vector systems or viral particles. 

The invention also relates to uses of these nucleic acids, vector 
systems, host cells, expression products, and/or compositions to produce mRNA, 
proteins, and/or infectious viral particles, and/or to induce antibodies and/or 
cytotoxic or helper T lymphocytes. 

The invention also relates to the use of these nucleic acid constructs, 
vectors, vector systems and or host cells for use in immunotherapy and 
immunoprophylaxis, e.g., as a vaccine, or in genetic therapy after expression, 
preferably in humans. The nucleic acid constructs of the invention can include or be 
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incorporated into lentiviral vectors or other expression vectors or they may also be 
directly injected into tissue cells resulting in efficient expression of the encoded 
protein or protein fragment. These constructs may also be used for in-vivo or in- 
vitro gene replacement, e.g., by homologous recombination with a target gene in- 
situ. 

IV. BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. DNA sequence of a mutated HIV-1 gag/pol molecular clone 
(SEQUENCE ID NO:l). The gagpol terminator is located at positions 4305-4307 
of SEQUENCE ID NO: 1 . 

Fig. 2. Comparison of the sequence of the wild -type and mutated 
pol region in pCMVgagpolBNkan. Position #1 in the figure is position 2641 in 
plasmid pCMVgagpolBNkan. The comparison starts at position 1872 from the gag 

■ 

initiator ATG. 

Fig. 3. DNA sequence of a mutated SIV gag molecular clone 

(SIVgagDX). 

Fig. 4. Comparison of the mutated SIV gag DNA sequence in 
SIVgagDX with the wild type SIV sequence from Simian (macaque) 
immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank accession 
No. M33262)-. • 

Fig. 5. Schematic diagram of some components of sample versions 
of a lentiviral system. BGH poly (A): bovine growth hormone poly (A) signal; 
MSD: mutated splice donor site; encapsidation signal; SD, splice donor site; 
SA, splice acceptor site; "X" indicates that the ATG codon of the partial gag gene 
sequence is mutated so that translation of this gene does not occur. 

Fig. 6. Schematic diagram of the packaging construct 
pCMVgagpolBNkan. 

Fig. 7. Schematic diagram of transfer construct 1: pmBCwCNluci. 
The packaging signal, the CMV promoter and the coding region for the luciferase 
gene are flanked by the 5' and 3 HIV-1 LTRs, which provide promoter and 
polyadenylation signals, as indicated by the arrows. Three consecutive arrows 
indicate the U5, R, and U3 regions of the LTR, respectively. The transcribed 
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portions of the LTRs are shown in black. Some restriction endonuclease cleavage 
sites are also indicated. 

Fig. 8. Schematic diagram of transfer construct 1: pmBCmCNluci. 
Symbols are as above. 

Fig. 9. DNA sequence of packaging construct pCMVgagpolBNkan. 

Fig. 10. DNA sequence of transfer construct 1: pmBCwCNluci. 

Fig. 11. DNA sequence of transfer construct 1: pmBCmCNluci. 

Figure 12: 

Fig. 12. Nucleotide sequence of the region BssHII (711) to Clal 
(830) in wild-type HIV-1 molecular clones HXB2 and NL4-3, and in the transfer 
constructs. The translation initiator signal for Gag protein (ATG) is underlined. 
pmBCwCNluci and pmBCmCNluci (transfer constructs 1 and 2) contain the 
sequence mBCwCN. Transfer construct 3 contains the sequence m2BCwCN. In 
contrast to the sequence mBCwCN, m2BCwCN has different mutations at the 5' 
splice site region and has an intact Gag ATG. 

Fig. 13. Bar graph showing levels of gag protein that is released 
from cells upon transient transfection with pCMVgagpolBNkan (labeled 
pCMVBNKan in the figure). 

Fig. 14. Bar graph showing reverse transcriptase activity from the 
Rev-independent gag-pol HIV-1 vector pCMVgagpolBNkan (labeled pCMVBNKan 
in the figure). 

Fig. 15. Bar graphs showing the amount of lucif erase per nanogram 
of p24 Gag protein detected in cells transducted with PCMVgagpolBNkan Rev- 
independent gag-HTV-1 based retroviral vectors. The results show that with 
PCMVgagpolBNkan Rev-independent gag-HIV-1 based retroviral vectors display 
high transduction efficiency in (A) 293 cells, (B) human lymphoid cells, (C) human 
myeloid cells (U937), as well as (D) non-dividing cells such as primary human 
macrophages. 

Fig. 16. Schematic diagram of the SIV envelope encoding vector 
CMVkan/R-R-SIVgp 1 60CTE. 

Fig. 17. DNA sequence of the SIV envelope encoding vector 
CMVkan/R-R-SIVgpl60CTE containing a mutated SIV env gene. 
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V. MODES FOR CARRYING OUT THE INVENTION 

It is to be understood that both the foregoing general description and 
the following detailed description are exemplary and explanatory only, and are not 
restrictive of the invention, as claimed. The accompanying drawings, which are 
incorporated in and constitute a part of the specification, illustrate an embodiment of 
the invention and, together with the description, serve to explain the principles of 
the invention. 

One aspect of the invention comprises vectors that encode the Gag 
and/or Pol of HIV- 1 in a Rev-independent manner. An example of such a vector 
which is described herein is the plasmid pCMVgagpolBNkan, which encodes the 
complete Gag and Pol of HIV-1 in a Rev-independent manner, and also contains a 
gene conferring kanamycin resistance. This plasmid is Tat and Rev-independent 
and was generated by eliminating the inhibitory/instability sequences present in the 
gag/pol mRNA without altering the amino acid sequence of the proteins coded by 
the genes. 

The gag/pol clone of the invention is a DNA construct of the gag/pol 
region of HIV which has had the inhibitory/instability regions removed. The 
construct is expected to be useful as a component a new type of lentivirus vector for 
use in gene therapy or as a vaccine. 

The gag, pol or gag/pol sequences of the invention can be highly 
expressed in human and other mammalian cells in the absence of any other 
regulatory and structural protein of HIV, including Rev. When the gag/pol 
sequences are combined with a sequence encoding an envelope protein, such as the 
VSV G protein or the HIV envelope protein (e.g., in the same vector or in another 
expression vector), infectious virus is produced after transfection into human cells. 
When a gene encoding a non-HIV envelope protein is used, for example, in the 
presence of the HIV gag/pol gene, the virus particles produced would contains only 

i 

the HIV proteins Gag and Pol. 

Lentiviral vectors or vector systems based on the gag, pol or gag/pol 
sequences of this invention, as exemplified by the Rev-independent pCMVgagpol 
BNkan construct described herein, may be used for gene therapy in vivo (e.g., 
parenteral inoculation of high titer vector) or ex vivo (e.g., in vitro transduction of 
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patient's cells followed by reinfusion into the patient of the transduced cells). These 
procedures are been already used in different approved gene therapy protocols. 

The HTV gag/pol clone and SIV gag clone of the invention were 
made using the method for eliminating inhibitory/instability regions from a gene as 
described in U.S. Patent No. 6,174,666, and also in related U.S. Patent Nos. 
5,972,596 and 5,965,726, which are incorporated by reference herein. This method 
does not require the identification of the exact location or knowledge of the 
mechanism of function of the INS. Generally, the mutations are such that the amino 
acid sequence encoded by the mRNA is unchanged, although conservative and non- 
conservative amino acid substitutions are also envisioned where the protein encoded 
by the mutated gene is substantially similar to the protein encoded by the non- 
mutated gene. The mutated genes can be synthetic (e.g., synthesized by chemical 
synthesis), semi-synthetic (e.g., a combination of genomic DNA, cDNA, or PGR 
amplified DNA and synthetic DNA), or recombinantly produced. The genes also 
may optionally not contain introns. The nucleic acids of the invention may also 
contain Rev-independent fragments of these genes which retain the desired function 
(e.g., for antigenicity of Gag or Pol, particle formation (Gag) or enzymatic activity 
(Pol)), or they may also contain Rev-independent variants which have been mutated 
so that the encoded protein loses a function that is unwanted in certain 
circumstances. In the latter case, for example, the gene may be modified to encode 
mutations (at the amino acid level) in the active site of reverse transcriptase or 
integrase proteins to prevent reverse transcription or integration. Rev-independent 
fragments of the gag gene are described in U.S. patent application Serial No. 
07/858,747, filed March 27, 1992, and also in related U.S. Patent Nos. 5,972,596 
and 5,965,726, which are incorporated by reference herein. If desired, the ATG 
initiation codon of any HIV accessory gene (e.g., vif), if present, may also be 
mutated. 

In addition to being capable of producing HTV Gag and Pol proteins 
in the absence of Rev regulatory protein in a cell in vivo , the HIV gag/pol clone and 
SIV gag clone of the invention are also capable of producing HIV Gag and Pol 
proteins in the absence of any added cis acting transport element, such as CTE or 
CTE-like elements (collectively refered herein as RNA Transport Elements (RTE)). 
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Experiments indicate that the mutated vectors of the invention for SIV gag are far 
superior to those adding CTE (see Qiu et aL, J Virol. 73:9145-52 (1999)). 

The expression of the proteins encoded by these vectors after 
transfection into human cells may be monitored at both the level of RNA and 
protein production. RNA levels are quantitated by methods known in the art, e.g., 
Northern blots, SI mapping or PCR methods. Protein levels may also be 
quantitated by methods known in the art, e.g., western blot or ELISA or fluorescent 
detection methods. A fast non-radioactive ELISA protocol can be used to detect 
gag protein (DUPONT or COULTER gag antigen capture assay). 

At least three types of lentiviral vectors based on the gag/pol genes 
of the invention for use in gene therapy and/or as a vaccine are envisioned, i.e., 
lentiviral vectors having 

a) no round of replication (i.e., a zero replication system) 

b) one round of replication 

c) a fully replicating system 

For a system with no round of replication, a gag/pol gene, or separate 
gag and pol genes, or fragments of these genes, expressed using appropriate 
transcription units, e.g., a CMV promoter and a BGH poly (A) site. This will allow 
expression of the gag/pol unit (or gag or pol or fragment(s) thereof) for vaccine 
purposes. This expression can be accomplished without the production of any 
functional retroviral enzymes, provided that the appropriate mutation(s), e.g., a 
missense mutation, are introduced. In a zero replication system, a virus stock will 
be administered to the cells or animals of interest. For example, if one creates and 
uses a virus stock with the exemplified system using the packaging vector 
PCMVgagpolBNKan, the transfer construct pmBCwCNluci or pmBCmCNluci, and 
the envelope containing vector pHCMV-G, one obtains a zero replication system. 
The virus particles produced by such system can infect cells, and the reverse 
transcribed transfer construct DNA will go into the nucleus but, because the coding 
regions for viral structural proteins are not present, there will be no virus expression 
and replication (0 rounds). If one transfects cells in vivo with the same 3 DNAs, 
they will go to the nucleus, express viral proteins, make infectious virus particles 
and go out and infect another cell or cells (1 round). Since in vivo delivery of three 



WO 02/099101 PCT/US02/17258 

- 17- 

plasmids may result in lower expression, at least two different embodiments are 
envisioned. In the first, two plasmids may be used, e.g., MV1 shown in Fig. 5 and 
an envelope expression plasmid such as pHCMV-G. Other plasmids encoding 
functional envelopes from HIV, SIV, or other retroviruses can also be used. 
Transfection by the two plasmids results in infectious virus that can infect and 
integrate into new cells (1 round). The infected cells produce gagpol but virus 
propagation is not possible in the absence of env. 

For a system with one round of replication, at least two additional 
embodiments are envisioned. In the first method, a combination of the genes, e.g., a 
gag/pol gene, an env encoding gene and, preferably, a gene encoding a reporter 
protein or other polynucleotide or protein of interest, are delivered into the cells of 
interest in vivo. As discussed above for the exemplified system, if one transfects 
cells in vivo with the same 3 DNAs, they will go to the nucleus, express viral 
proteins, make infectious virus particles, be released and infect another cell or cells 
(1 round). 

In another embodiment, the same result (i.e., only one round of 
replication) can be obtained by using transfer vectors that have deletions in the 3' 
LTR and in which a heterologous-promoter (e.g., the CMV-promoter, or inducible 
promoter, or tissue-specific promoter), is used in place of the '3 'LTR promoter. 
The mutations in the 3' LTR making it inactive upon reverse transcription and 
integration. This is because the integrated provirus derives both its 5* LTR and its 3' 
LTR from the 3' LTR of the starting (transfer) construct. (This is a well-known 
property of all retroviruses and has been used to make self-inactivating vectors 
(SIN)). There are several reasons one may want to inactivate the incoming LTR 
promoter, one of which is to use a different tissue specific or regulated promoter for 
expression of a gene of interest in the integrated provirus. Note that, with SIN 
vectors, if one uses a viral stock made in vitro after transfection into cells and 
collection of infectious virus, there will be no round of replication. If one transfects 
cells with the DNAs in vivo, there will be one round of replication. If functional 
gag, poU or env are not included in the DNA mix, there will not be any infection at 
all (i.e., infectious viruses will not be made). 
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A fully replicating Rev-independent system has not been constructed 
yet, although it is expected that a functional system can be constructed using Rev- 
independent gag/pol and env sequences. If desired, extra posttranscriptional control 
elements such as the CTE element, which can replace Rev and give infectious virus 
(see e.g., Zolotukhin et aL, J.Virol.68:944-7952 (1994)) are included. The fully 
replicating system should be in one piece, containing the LTR, packaging signal, 
gag/pol, splice site, env, tat, one or more CTE or CTE-like elements (if desired for 
optimal results), and LTR. Tat is thought to be required in this construct, at least in 
non-permissive cells. Such a system is depicted in Figure 5, (construct MV2). In 
this system, a cell or animal of interest (preferably human) would be infected with 
virus stock that then propagates. CTE or CTE-like elements (depicted in construct 
MV2 as RTE (RNA Transport Elements)) are desirable since they have been shown 
to improve expression, and since many retroviruses require the presence of 
posttranscriptional control elements. There are several types of CTE and CTE-like 
elements, and these elements appear to work via a different pathway from the Rev- 
RRE pathway. See, e.g., Tabernero et aL, J Virol. 71:95-101 (1997). See also, 
Pavlakis and Nappi, PCT/US99/11082, filed May 22, 1999, published as WO 
99/61596 on December 2, 1999 (and incorporated herein by reference), which 
describes a new type of post-transcriptional control element that is able to replace 
CTE and HIV RRE/Rev. The Pavlakis-Nappi element does not work in the same 
way as CTE and does not have any sequence or structure homology. 

In a preferred embodiment, a lentiviral system of the invention 
comprises the following three components: 

1 . a packaging vector containing nucleic acid sequences 

encoding the elements necessary for vector packaging such as 
structural proteins (except for HIV env) and the enzymes 
required to generate vector particles, the packaging vector 
comprising at least a mutated HIV or SIV gag/pol gene of the 
invention; 
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2. a transfer vector containing genetic cis-acting sequences 
necessary for the vector to infect the target cell and for 
transfer of the therapeutic or reporter or other gene(s) of 
interest, the transfer vector comprising the encapsidation 
signal and the gene(s) of interest or a cloning site for inserting 
the gene(s) of interest; 

and 

3. a vector containing sequences encoding an element necessary 
for targeting the viral particle to the intended recipient cell, 
preferably the gene encoding the G glycoprotein of the 
vesicular stomatis virus (VS V-G) or amphotrophic MuLV or 
lentiviral envs. 

Using the CMV promoter or other strong, high efficiency, promoter 
instead of the HIV-l LTR promoter in the packaging vector, high expression of gag, 
pol, or gag/pol can be achieved in the total absence of any other viral protein. The 
exchange of the HIV-l LTR promoter with other promoters is beneficial in the 
packaging vector or other vectors if constitutive expression is desirable and also for 
expression in other mammalian cells, such as mouse cells, in which the HIV-l 
promoter is weak. Vectors containing the sequences of the invention can be used 
for the Rev independent production of HIV-l Gag/Pol, HIV-l Gag, HIV-l Pol, and 
SIV Gag proteins. In certain embodiments, the presence of heterologous promoters 
will also be desired in the transfer vector and the envelope encoding vector, when 
such vectors are used. 

The gene(s) of interest are chosen according to the effect sought to be 
achieved. For gene therapy purposes there will be at least one therapeutic gene 
encoding a gene product which is active against the condition it is desired to treat or 
prevent. Alternatively or additionally, there may be a gene which acts as a marker 
by encoding a detectable product. Therapeutic genes may encode, for example, an 
anti-sense RNA, a ribozyme, a transdominant negative mutant of a target protein, a 
toxin, a conditional toxin, an antigen that induces antibodies or helper T-cells or 
cytotoxic T-cells, a single chain antibody or a tumor suppresser protein. See, e.g., 
WO 98/17816. 
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An even more extensive list of genes of interest for use in lentiviral 
vectors is described, e.g., in WO 99/04026 on page 10, line 20 to page 12, line 7. 
Table 2 of Klimatcheva et al. (1999) also provides a list of disorders and target cells 
for gene therapy, as well as a number of lentiviral vectors used by others. This list 
includes genetic/metabolic deficiencies, viral infection and cancer. Inherited 
genetic defects such as adenosine deaminase deficiency, familial 
hypercholesterolemia, cystic fibrosis, mucopolysaccharidosis type VII, types I and 
II diabetes, classical phenylketonuria and Gaucher disease are diseases which are 
listed as being possible to overcome by lentiviral vector-mediated gene therapy 
because they constitute single-gene deficiencies for which the involved genes are 
known. Viral diseases are also listed as constituting appropriate targets for lentiviral 
gene delivery. In particular, a number of gene therapy approaches have been 
proposed for the treatment of HIV infection and, for some of these strategies, phase 
I studies have recently begun in humans. The article states that preliminary studies 
have dealt with defective murine oncoviruses for delivery of anti-sense RNAs, 
ribozymes and trans-dominant proteins against HIV replication. 

In any of the vectors, but preferably in the transfer vector, an inserted 
gene could have an internal ribosomal entry site (IRES), e.g., from picornaviral 
RNA. An IRES will be used in circumstances that one wants to express two 
proteins from the same promoter. For example one protein of interest and a marker 
gene, e.g., green fluorescent protein (GFP) or a marker gene and a drug resistance 
gene (e.g. the firefly lucif erase gene and neomycin phosphotransferase gene) as 
described on p. 58 of WO 99/04026, for example. Using an IRES the expression of 
the two proteins is coordinated. A further gene or genes may also be present under 
the control of a separate promoter. Such a gene may encode for example a 
selectable marker, or a further therapeutic agent which may be among the 
therapeutic agents listed above. Expression of this gene may be constitutive; in the 
case of a selectable marker this may be useful for selecting successfully transfected 
packaging cells, or packaging cells which are producing particularly high titers of 
the retroviral vector particles. Alternatively or additionally, the selectable marker 
may be useful for selecting cells which have been successfully infected with the 
lentiviral vector and have the provirus integrated into their own genome. 



WO 02/099101 PCT/US02/17258 

-21 - 

One way of performing gene therapy is to extract cells from a 
patient, infect the extracted cells with a lenti viral vector and reintroduce the cells 
back into the patient. A selectable marker may be used to provide a means for 
enriching for infected or transduced cells or positively selecting for only those cells 
which have been infected or transduced, before reintroducing the cells into the 
patient. This procedure may increase the chances of success of the therapy. 
Selectable markers may be for instance drug resistance genes, metabolic enzyme 
genes, or any other selectable markers known in the art. Typical selection genes 
encode proteins that confer resistance to antibiotics and other toxic substances, e.g., 
histidinol, puromycin, hygromycin, neomycin, methotrexate etc. and cell surface 
markers. 

However, it will be evident that for many gene therapy applications 
of lentiviral vectors, selection for expression of a marker gene may not be possible 
or necessary. Indeed expression of a selection marker, while convenient for in vitro 
studies, could be deleterious in vivo because of the inappropriate induction of 
cytotoxic T lymphocytes (CTLs) directed against the foreign marker protein. Also, 
it is possible that for in vivo applications, vectors without any internal promoters 
will be preferable. The presence of internal promoters can affect for example the 
transduction titres obtainable from a packaging cell line and the stability of the 
integrated vector. Thus, single transcription unit vectors, which may be bi-cistronic 
or poly-cistronic, coding for one or two or more therapeutic genes, may be the 
preferred vector designed for use in vivo. See, e.g., WO 98/17816. 

Suitable host or producer cells for use in the invention are well 
known in the art. May lentiviruses have already been split into replication defective 
genomes and packaging components. For those which have not the technology is 
available for doing so. The producer cell encodes the viral components not encoded 
by the vector genome such as the Gag, Pol and Env proteins. The gag, pol and env 
genes may be introduced into the producer cell transiently, or may be stably 
integrated into the cell genome to give a packaging cell line. The lentiviral vector 
genome is then introduced into the packaging cell line by transfection or 
transduction to create a stable cell line that has all of the DNA sequences required to 
produce a lentiviral vector particle. Another approach is to introduce the different 
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DNA sequences that are required to produce lentiviral vector particle, e.g., the env 
coding constrict, the gag-pol coding construct and the transfer construct into the cell 
simultaneously by transient triple transfection. 

Target cells identified by Klimatcheva et al. (1999), and the 
references cited therein, include airway epithelial cells for cystic fibrosis; retinal 
photoreceptor cells for retinitis pigmentosa; progenitors for red blood cells, 
macrophages, and lymphocytes for hematopoietic disorders, sickle cell anemia, B- 
thalassemia, lysosomal storage disorders, mucopolysaccharidoses, and severe 
combined immunodeficiency syndrome; bone marrow cells and macrophages for 
Gaucher f s disease; liver cells for familial hypercholesterolaemia; T-lymphocytes 
and macrophages for HIV infection; brain tissue, neurons, and glial cells for 
neurodegenerative diseases such as Parkinson's and Alzheimer's diseases; 
endothelial cells and cardiac myocytes for cardiovascular diseases; and cancer cells 
in various tissues (e.g. liver or brain) for cancer. Target cells for other diseases 
would be apparent to one of skill in the art. 

Vaccines and pharmaceutical compositions comprising at least one of 
the nucleic acid sequences, vectors, vector systems, or transduced or transfected 
host cells of the invention and a physiologically acceptable carrier are also part of 
the invention. 

As used herein, the term "transduction" generally refers to the 
transfer of genetic material into the host via infection, e.g., in this case by the 
lentiviral vector. The term "transfection" generally refers to the transfer of isolated 
genetic material into cells via the use of specific transfection agents (e.g., calcium 
phosphate, DEAE Dextran, lipid formulations, gold particles, and other 
microparticles) that cross the cytoplasmic membrane and deliver some of the 
genetic material into the cell nucleus. 

Systems similar to those described herein can be produced using 
elements of lentiviruses in addition to the HIV and/or SIV genes described herein. 

Pharmaceutical Compositions 

The pharmaceutical compositions of the invention contain a 
pharmaceutically and/or therapeutically effective amount of at least one nucleic acid 



WO 02/099101 PCT/US02/17258 

-23- 

construct, vector, vector system, viral particle/virus stock, or host cell (i.e., agents) 
of the invention. In one embodiment of the invention, the effective amount of an 
agent of the invention per unit dose is an amount sufficient to cause the detectable 
expression of the gene of interest. In another embodiment of the invention, the 
effective amount of agent per unit dose is an amount sufficient to prevent, treat or 
protect against deleterious effects (including severity, duration, or extent of 
symptoms) of the condition being treated. The effective amount of agent per unit 
dose depends, among other things, on the species of mammal inoculated, the body 
weight of the mammal and the chosen inoculation regimen, as is well known in the 
art. The dosage of the therapeutic agents which will be most suitable for 
prophylaxis or treatment will also vary with the form of administration, the 
particular agent chosen and the physiological characteristics of the particular patient 
under treatment. The dose is administered at least once. Subsequent doses may be 
administered as indicated. 

To monitor the response of individuals administered the 
compositions of the invention, mRNA or protein expression levels may be 
determined. In many instances it will be sufficient to assess the expression level in 
serum or plasma obtained from such an individual. Decisions as to whether to 
administer another dose or to change the amount of the composition administered to 
the individual may be at least partially based on the expression levels. 

The term "unit dose" as it pertains to the inocula refers to physically 
discrete units suitable as unitary dosages for mammals, each unit containing a 
predetermined quantity of active material (e.g., nucleic acid, virus stock or host cell) 
calculated to produce the desired effect in association with the required diluent. The 
titers of the virus stocks to be administered to a cell or animal will depend on the 
application and on type of delivery (e.g., in vivo or ex vivo). The virus stocks can be 
concentrated using methods such as centrifugation. The titers to be administered ex 
vivo are preferably in the range of 0.001 to 1 infectious unit /cell. Another method 
of generating viral stocks is to cocultivate stable cell lines expressing the virus with 
the target cells. This method has been used to achieve better results when using 
traditional retroviral vectors because the cells can be infected over a longer period 
of time and they have the chance to be infected with multiple copies of the vector. 
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For in vivo administration of nucleic acid constructs, vectors, vector 
systems, virus stocks, or cells which have been transduced or transfected ex vivo, the 
dose is to be determined by dose escalation, with the upper dose being limited by 
the onset of unacceptable adverse effects. Preliminary starting doses may be 
extrapolated from experiments using lenti viral vectors in animal models, by 
methods known in the art, or may be extrapolated from comparisons with known 
retroviral (e.g., adenoviral) doses. Generally, small dosages will be used initially 
and, if necessary, will be increased by small increments until the optimum effect 
under the circumstances is reached. Exemplary dosages are within the range of 10 s 
up to approximately 5 x 10 15 particles. 

Inocula are typically prepared as a solution in a physiologically 
acceptable carrier such as saline, phosphate-buffered saline and the like to form an 
aqueous pharmaceutical composition. 

The agents of the invention are generally administered with a 
physiologically acceptable carrier or vehicle therefor. A physiologically acceptable 
carrier is one that does not cause an adverse physical reaction upon administration 
and one in which the nucleic acids are sufficiently soluble to retain their activity to 
deliver a pharmaceutically or therapeutically effective amount of the compound. 
The pharmaceutically or therapeutically effective amount and method of 
administration of an agent of the invention may vary based on the individual patient, 
the indication being treated and other criteria evident to one of ordinary skill in the 
art. A therapeutically effective amount of a nucleic acid of the invention is one 
sufficient to prevent, or attenuate the severity, extent or duration of the deleterious 
effects of the condition being treated without causing significant adverse side 
effects. The route(s) of administration useful in a particular application are apparent 
to one or ordinary skill in the art. 

Routes of administration of the agents of the invention include, but 
are not limited to, parenteral, and direct injection into an affected site. Parenteral 
routes of administration include but are not limited to intravenous, intramuscular, 
intraperitoneal and subcutaneous. The route of administration of the agents of the 
invention is typically parenteral and is preferably into the bone marrow, into the 
CSF intramuscular, subcutaneous, intradermal, intraocular, intracranial, intranasal, 
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and the like. See, e.g., WO 99/04026 for examples of formulations and routes of 
administration. 

The present invention includes compositions of the agents described 
above, suitable for parenteral administration including, but not limited to, 
pharmaceutically acceptable sterile isotonic solutions. Such solutions include, but 
are not limited to, saline and phosphate buffered saline for nasal, intravenous, 
intramuscular, intraperitoneal, subcutaneous or direct injection into a joint or other 
area. 

In providing the agents of the present invention to a recipient 
mammal, preferably a primate, most preferably a human, the dosage administered 
will vary depending upon such factors as the mammal's age, weight, height, sex, 
general medical condition, previous medical history and the like. 

The administration of the pharmaceutical compositions of the 
invention may be for either "prophylactic" or "therapeutic" purpose. When 
provided prophylactically, the compositions are provided in advance of any 
symptom. The prophylactic administration of the composition serves to prevent or 
ameliorate any subsequent deleterious effects (including severity, duration, or extent 
of symptoms) of the condition being treated. When provided therapeutically, the 
composition is provided at (or shortly after) the onset of a symptom of the condition 
being treated. 

For all therapeutic, prophylactic and diagnostic uses, one or more of 
the agents of the invention, as well as antibodies and other necessary reagents and 
appropriate devices and accessories, may be provided in kit form so as to be readily 
available and easily used. 

Where immunoassays are involved, such kits may contain a solid 
support, such as a membrane (e.g., nitrocellulose), a bead, sphere, test tube, rod, and 
so forth, to which a receptor such as an antibody specific for the target molecule will 
bind. Such kits can also include a second receptor, such as a labeled antibody. Such 
kits can be used for sandwich assays to detect toxins. Eats for competitive assays 
are also envisioned. 
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VI. INDUSTRIAL APPLICABILITY 

Mutated genes of this invention can be expressed in the native host 
cell or organism or in a different cell or organism. The mutated genes can be 
introduced into a vector such as a plasmid, cosmid, phage, virus or mini- 
chromosome and inserted into a host cell or organism by methods well known in the 
art. In general, the mutated genes or constructs containing these mutated genes can 
be utilized in any cell, either eukaryotic or prokaryotic, including mammalian cells 
(e.g., human (e.g., HeLa), monkey (e.g., Cos), rabbit (e.g., rabbit reticulocytes), rat, 
hamster (e.g., CHO and baby hamster kidney cells) or mouse cells (e.g., L cells), 
plant cells, yeast cells, insect cells or bacterial cells (e.g., K coli). The vectors 
which can be utilized to clone and/or express these mutated genes are the vectors 
which are capable of replicating and/or expressing the mutated genes in the host cell 
in which the mutated genes are desired to be replicated and/or expressed. See , e.g., 
F. Ausubel et al., Current Protocols in Molecular Biology , Greene Publishing 
Associates and Wiley-Interscience (1992) and Sambrook et al. (1989) for examples 
of appropriate vectors for various types of host cells. The native promoters for such 
genes can be replaced with strong promoters compatible with the host into which 
the gene is inserted. These promoters may be inducible. The host cells containing 
these mutated genes can be used to express large amounts of the protein useful in 
enzyme preparations, pharmaceuticals, diagnostic reagents, vaccines and 
therapeutics. 

Mutated genes or constructs containing the mutated genes may also 
be used for in-vivo or in-vitro gene therapy. For example, a mutated gene of the 
invention will produce an mRNA in situ to ultimately increase the amount of protein 
expressed. Such gene include viral genes and/or cellular genes. Such a mutated 
gene is expected to be useful, for example, in the development of a vaccine and/or 
genetic therapy. 

The constructs and/or proteins made by using constructs encoding 
the mutated gag, env, and pol genes could be used, for example, in the production of 
diagnostic reagents, vaccines and therapies for AIDS and AIDS related diseases. 
The inhibitory/instability elements in the HIV-1 gag gene may be involved in the 
establishment of a state of low virus production in the host. HIV-1 and the other 
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lentiviruses cause chronic active infections that are not cleared by the immune 
system. It is possible that complete removal of the inhibitory/instability sequence 
elements from the lentiviral genome would result in constitutive expression. This 
could prevent the virus from establishing a latent infection and escaping immune 
system surveillance. The success in increasing expression of the entire gag/pol gene 
by eliminating the inhibitory sequence element suggests that one could produce 
lentiviruses without any negative elements. Such lentiviruses could provide a novel 
approach towards attenuated vaccines. 

For example, vectors expressing high levels of Gag can be used in 
immunotherapy and immunoprophylaxis, after expression in humans. Such vectors 
include retroviral vectors and also include direct injection of DNA into muscle cells 
or other receptive cells, resulting in the efficient expression of gag, using the 
technology described, for example, in Wolff et al., Science 247:1465-1468 (1990), 
Wolff et al., Human Molecular Genetics l(6):363-369 (1992) and Ulmer et al., 
Science 259:1745-1749 (1993). Further, the gag constructs could be used in 
transdominant inhibition of HIV expression after the introduction into humans. For 
this application, for example, appropriate vectors or DNA molecules expressing 
high levels of p55 gag or p37 gag would be modified to generate transdominant gag 
mutants, as described, for example, in Trono et al., Cell 59:113-120 (1989). The 
vectors would be introduced into humans, resulting in the inhibition of HIV 
production due to the combined mechanisms of gag transdominant inhibition and of 
immunostimulation by the produced gag protein. In addition, the gag constructs of 
the invention could be used in the generation of new retroviral vectors based on the 
expression of lentiviral gag proteins. Lentiviruses have unique characteristics that 
may allow the targeting and efficient infection of non-dividing cells. Similar 
applications are expected for vectors expressing high levels of env. 

Identification of similar inhibitory/instability elements in SIV 
indicates that this virus is a convenient model to test these hypotheses. SIV 
similarly modified could be used in place of HIV in an effort to further minimize the 
possibility of rearrangement events that would lead to the generation of infectious 
HTV. 
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The following examples illustrate certain embodiments of the present 
invention, but should not be construed as limiting its scope in any way. Certain 
modifications and variations will be apparent to those skilled in the art from the 
teachings of the foregoing disclosure and the following examples, and these are 
intended to be encompassed by the spirit and scope of the invention. 

EXAMPLE 1 

Rev-Independent HIV-1 Gag/Pol Molecular Clone 

Figure 1 shows the DNA sequence of a Rev-independent HIV-1 
gag/pol molecular clone. This DNA sequence shown encodes the complete Gag and 
Pol of HIV-1 and can be expressed in a Rev-independent manner when operably 
linked to a promoter. The Rev-independent gag sequence was described in U.S. 
Patent Nos. 6,174,666, 5,972,596 and 5,965,726 and the Rev-independent pol 
sequence was generated by eliminating the inhibitory/instability sequences using the 
methods described in those patents. Others have reportedly made Rev independent 
gag sequences by optimizing codon usage for human cells (see, e.g., WO 
98/34640). 

Figure 2 shows an alignment of the sequence of the wild - type and 
mutated pol region in pCMVgagpolBNkan. Position #1 in the figure is position 
2641 in plasmid pCMVgagpolBNkan. 

The elimination of INS in gag, pol and env regions allows the 
expression of high levels of authentic HIV-1 structural proteins in the absence of the 
Rev regulatory factor of HIV- 1 . 

EXAMPLE 2 

Rev-Independent SIV Gag Molecular Clone 

Figure 3 shows the DNA sequence of a Rev-independent SIV gag 
molecular clone, SIVgagDX. Figure 4 shows the comparison of wild type (WT) 
and mutant (SIVgagDX) sequences. The wild type SIV sequence is from Simian 
(macaque) immunodeficiency virus isolate 239, clone lambda siv 239-1 (GenBank 
accession No. M33262). 
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EXAMPLE 3 

Rev-Independent SIV Env Molecular Clone 

Figure 16 shows a schematic diagram, and figure 17 shows the DNA 
sequence, of the "env-coding" vector CMVkan/R-R-SIVgpl60CTE, which is an 
example of a vector comprising a mutated lentiviral env gene sequence which is 
capable of being expressed independently of any SIV or HIV regulatory factors. 
"CMV" denotes the cytomegalovirus promoter; "SRV-CTE" denotes the 
constitutive transport element (CTE) of Simian Retrovirus Type 1; "all-STOP" 
denotes a sequence providing translational stops in all three reading frames; "BGH 
terminator" denotes the bovine growth hormone polyadenylation signal. Other 
posttranscriptional control elements can be used instead of the indicated SRV-CTE, 
for example the one described by Pavlakis and Nappi, PCT/US99/11082, filed May 
22, 1999, which was published as WO 99/61596 on December 2, 1999 (and which 
is incorporated herein by reference). 

As mentioned previously above, such a vector encoding a lentiviral 
env gene may be used if it is desired that the vector infect CD4 + T cells. Also as 
mentioned previously above, the CTE element (i.e., the SRV-CTE element in the 
case of vector CMVkan/R-R-SIVgp 1 60CTE), can be replaced with another post- 

» 

transcriptional control element, such as the Pavlakis-Nappi element, that is able to 
replace CTE and HIV RRE/Rev. See Pavlakis and Nappi, PCT/US99/11082, filed 
May 22, 1999, which was published as WO 99/61596 on December 2, 1999 (and 
which is incorporated herein by reference). 

EXAMPLE 4 

Lentivirial Vector System 

Figure 5 is a schematic of some of the components of a preliminary 
version of the Rev-independent lentiviral vector system exemplified herein, 
including a packaging construct and three different transfer vectors which may be 
used. In the lentiviral system exemplified herein, the packaging construct also 
contains the gene for kanamycin resistance. The lentiviral system exemplified 
herein also contains the vector pHCMV-G, which is shown in Figure 5. 
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In the packaging construct shown in Figure 5, "CMV" denotes the 
cytomegalovirus promoter, "Gag" denotes the gag gene, which generates 
components of the virion core, "Pro" denotes "protease" "RT" denotes "reverse 
transcriptase," c Int" denotes "integrase" and "BGH poly (A)" denotes the bovine 
growth hormone polyadenylation signal. The protease, reverse transcriptase, and 
integrase genes comprise the "pol" gene. In transfer construct 1, "LTR" denotes the 
HIV "long terminal repeat", which contains a HIV promoter; "mSD" denotes 
"mutated splice donor site," which is present in the construct so that splicing of the 
RNA transcript does not occur; "\|/" denotes the encapsidation signal; "wGA" 
denotes part of the wild-type gag gene which contains sequences believed to be 
necessary for encapsidation; "X" indicates that the ATG codon of the partial gag 
gene sequence is mutated so that translation of this gene does not occur; "CMV" 
denotes the cytomegalovirus promoter and luciferase is used as a reporter gene. 
Luciferase can be replaced with any gene of interest. Another HIV LTR is present 
at the 3' end of transfer construct 1. Replacement of this LTR in constructs such as 
the transfer construct 1, 2, or 3 with a promoter-enhancer deleted HIV LTR leads to 
inactivation of LTR after integration. Transfer construct 2 is similar to transfer 
construct 1 , the difference being that a mutated part of the gag gene (denoted 
"mGa") is used instead of the wild-type part of the gag gene. Transfer construct 3 
(pm2BCwCNluci) has different mutations at the 5' splice site and has an intact ATG 
codon so that translation of part of the mutated gag gene occurs. Transfer construct 
3 also has a 5' CMV promoter instead of a 5' LTR promoter. This construct is 
expressed independent of the presence of HIV Tat protein. The transfer constructs 
expressed from the LTR promoter are partially dependent on Tat protein. In 293 
cells significant expression can be achieved in the absence of Tat. See, e.g., 
Valentin et al., Proc. Natl Acad. Sci. USA. 95:8886-91 (1988). 

EXAMPLE 5 

Generation of Packaging Construct pCMVgagpol BNkan 

Figure 6 shows a schematic map of the packaging construct pCMV 
gagpolBNKan. The nucleotide numbering is that of the HXB2R sequence 
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(Genbank accession number K03455 and M38432), where +1 is the start of 
transcription. 

The sequence in HIV-1 gag/pol region was mutated in order to 
eliminate all the INS. The fragment from the beginning of gag to BsrGI site in pol, 
and the fragment KE [Kpnl(3700> EcoRI(4194)] were previously mutated 
described in Schneider et al., J Virol. 71: 4892-4903 (1997) and in U.S. Patent Nos. 
6,174,666, 5,972,596 and 5,965,726. 

To generate pCMVgagpolBNkan, three fragments within HIV-1 pol 
region were mutated. They are fragment BP [BsrGI(2207)-PflMI(3032)] 5 fragment 
PK [PflMI(3032)-KpnI(3700)] and fragment EN [EcoRI(4194)-NdeI(4668)]. 
Mutagenesis was performed using a modified version of the method described by 
Ho et al., Gene 77: 51-59 (1989) and DNA shuffling (Zhao and Arnold, Nucl. Acid 
Res. 25(6), 1307-1308 (1997). Sixteen oligonucleotides extending over the 
complete sequence of the three fragments were designed. Six oligos corresponded 
to fragment BP, six to fragment PK, and four to fragment EN (the oligonucleotides 
ranged from 130 to 195 bases in length; adjacent oligos overlapped by twenty 
nucleotides). Each fragment was assembled in two steps: 

1) PCR; the reaction was carried out in standard pfu buffer with 
10 pmol of each purified big oligo, 0.2 mM of each dNTPs and 2.5 u pfu DNA 
polymerase enzyme (Stratagene) in a 50 julI final volume. The PCR program was: 3 
min 96°C followed by 50 cycles of 1 min 94°C, 1 min 55°C, and 1 min + 5 s/cycle 
72°C, ended by 7 min at 72°G. After PCR, the big oligonucleotides were removed 
from the assembled mutated fragment. 

2) The second step was to specifically amplify the assembled 
products with 30 mer primers located at the 5' and 3' end of each mutated fragment. 
One microliter of the assembled PCR product was used as template in a 25-cycle 
PCR reaction with 50 pmol of each primer, 1 x pfu buffer, 0.2 mM of each dNJP 
and 2.5 u pfu DNA polymerase in a 50 julI final volume. The PCR program was: 3 
min 96°C, 10 cycles of 30 s 94°C, 30 s 55°C, 45 s 72°C, followed by another 14 
cycles of 30 s 94°C, 30 s 55°C, 45 s + 20 s/cycle 72°C, and finally 7 min 72°C. 
This program gave a single PCR product of the correct size. The amplified BP, PK 
and EN fragments were individually cloned into PCR-script™ vector using PCR- 
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script™ Amp SK(+) Cloning Kit (Stratagene). Clones were randomly selected and 
sequenced. The correct BP, PK and EN fragments together with fragment KE 
, previously mutated by Schneider et al. were ligated between BsrGI and Kpnl site of 
p55AMl-R5 (which was previously described in Schneider et al., J. Virol. 71: 4892- 
4903 (1997)) to produce a completely mutated gagpol ORF. The new plasmid 
containing the completely mutated gag/pol was named pLTRgagpolBN. BN stands 
for the modification of the fragment between BsrGI and Ndel. The mutated gag/pol 
was then cloned into a CMVkan vector containing the cytomegalovirus major late 
promoter (GenBank accession no. X17403) and the kanamycin resistance gene, 
resulting in pCMVgagpolBNkan. The plasmid backbone comes from pVR1332 
provided by Vical Inc., and described in Hartikka et al., Hum Gene Ther. 7:1205-17 
(1996). 

It is understood that different plasmid backbones can be used, e.g., to 
provide good expression in vivo, in the case of DNA injection, for example. 

EXAMPLE 6 

Construction of Transfer Vectors pmBCwCNluci and pmBCmCNluci 

The HIV-1 sequence BC, between BssHII (257) and Clal (376), 
contains the major splice donor site and the encapsidation signal. Six oligos (33 to 
46 bases) were designed to introduce mutations on the splice donor site and the 
AUG start codon of gag. The BC fragment was assembled, amplified and 
sequenced as described in the section concerning the construction of 
pCMVgagpolBN. 

The mutated BC fragment and a fragment of wild type gag between 
Clal (376) and Nsi (793) were placed between the BssHII and Nsi sites of p55RRE 
(Schneider et al., J. Virol. 71:4892-4903 (1997)) to generate pmBCwCN. In 
parallel, the fragment between Clal (376) and Nsil sites of mutated gag from 
p55BMl-10SD+ was used to generate pmBCmCN. (p55BMl-10SD+ is similar to 
p55BMl-10, which is described in Schneider et al. (1997), but contains in addition 
the intact splice donor and encapsidation site upstream of gag). The region between 
Nsil and Xhol containing 3' part of gag and RRE in pmBCwCN and pmBCmCN 
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was replaced by a Clal-Xhol fragment containing CMV promoter and luciferase 
gene from pHR'-CMVluci (vector from D. Trono) to generate pmBCwCNluci and 
pmBCmCNluci (which are shown as transfer constructs 1 and 2 in Figure 5, and 
schematically depicted in Figures 7 and 8, respectively). The sequences of these 
plasmids are shown in Figures 10 and 11, respectively. Different versions of these 
plasmids have also been created, by standard procedures, with variations in the 
region of the encapsidation site, the first splice donor site, and the initiator gag 
AUG. For example, the transfer construct pm2BcwCNluci (which is shown as 
transfer construct 3 in Fig. 5) has different mutations in the 5 s splice site region and 
has an intact ATG. A comparison of the sequences in the BssHH-Cla I region of 
transfer constructs 1 and 2 (mBCwCN frag), transfer construct 3 (m2BCwCN frag), 
HXB2 and NL43 is shown in Fig. 12. 

EXAMPLE 7 

■ * 

Preparation of Viral Particles 

Lentiviral particles were generated by transient cotransfection of 293 
human kidney cells with a combination of three plasmids: pCMVgagpolBNkan, 
pmBCwCNluci or pmBCmCNluci (transfer vector) and pHCMV-G (Yee et al., 
Proc. Natl. Acad. Sci., USA, 91:9564-9568 (1994) a plasmid coding for the 
envelope VSV-G (glycoprotein of vesicular stomatitis virus). 

The day before the transfection, 293 cells were plated at a density of 
10 cells/plate on a 60 mm plate. Plasmid DNA was transfected by the Ca-phosphate 
precipitation method in the following proportions: 3 jug packaging construct, 6 \xg 
transfer construct and 100 ng VSV-G encoding construct, pHCMV-G. [Note that 
the LTR promoter can be expressed in 293 cells in the absence of Tat with a 
moderate decrease in efficiency. The transfer constructs can be fully Tat 
independent after replacement of the LTR promoter with a CMV (see, e.g., transfer 
construct 3 in Fig. 5) or other promoter in such a way that the mRNA start site is at 
the beginning of the LTR R region.] In the present experiments for preparation of 
viral particles 500 ng of a Tat expression plasmid was included in the transfection. 
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Cells were washed the day after transfection and were kept in 
DMEM medium for another 48 hours before the supernatants were harvested. 
Supernatants were spun at 1,200 rpm for 7 mins to eliminate any floating cells. 
pCMVgagpolBNkan produces high levels of Gag protein that is efficiently released 
from the cells (Figure 13), and also produces high levels of functional Pol as judged 
by levels of reverse transcriptase activity similar to those found upon expression of 
complete HIV-1 (Figure 14). 

r 

Supernatants from 293 transfected cells were used to transduce 
several human cell lines (293, Jurkat, U937) and non-dividing human primary 
macrophages. 

EXAMPLE 8 

Cell Transduction 

Transduction was performed by incubating for 3-4 hours at 37°C the 
target cells with 1-2 ml of supernatant containing the retroviral vectors. The amount 
of retroviral vector present in the supernatant was normalized by p24 content 
(measured by ELISA). Equal amounts of p24 gag protein were used for infection of 
cells. This way, differences in production of the different preparations was 
minimized. 

The macrophages used for transduction were isolated from the 
peripheral blood of healthy donors by adherence to plastic. Cells were cultured in 
RPMI + 20% fetal calf serum (FCS) + 10% human serum (HS). After 1 week, non- 
adherent cells were washed off with PBS and the macrophages were kept in culture 
for another 1-2 weeks in the absence of human serum. The cells were washed 2-4 
times with PBS before transduction. 

Cells were harvested 48 hours after transduction (seven days for 
primary macrophages) and the transduction efficiency was determined by measuring 
lucif erase activity in cell extracts from the cultures. The results of the transduction 
experiments in 293 Jurkat, U937 and primary macrophages are shown in Figure 
15A-D. These results demonstrate that Rev-independent gag-HFV- 1 based 
retroviral vectors display high transduction efficiency in (A) 293 cells, (B) human 
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lymphoid cells, (C) human myeloid cells (U937), as well as (D) non-dividing cells 
such as primary human macrophages. 

EXAMPLE 9 

Use Of Nucleic Acids of the Invention 
In hnmunoprophvlaxis Or Immunotherapy 

In postnatal gene therapy, new genetic information has been 
introduced into tissues by indirect means such as removing target cells from the 
body, infecting them with viral vectors carrying the new genetic information, and 
then reimplanting them into the body; or by direct means such as encapsulating 
formulations of DNA in liposomes; entrapping DNA in proteoliposomes containing 
viral envelope receptor proteins; calcium phosphate co-precipitating DNA; and 
coupling DNA to a polylysine-glycoprotein carrier complex. In addition, in vivo 
infectivity of cloned viral DNA sequences after direct intrahepatic injection with or 
without formation of calcium phosphate coprecipitates has also been described. 
mRNA sequences containing elements that enhance stability have also been shown 
to be efficiently translated in Xenopus laevis embryos, with the use of cationic lipid 
vesicles. See , e.g., J.A. Wolff, et al., Science 247:1465-1468 (1990) and references 
cited therein. 

Recently, it has also been shown that injection of pure RNA or DNA 
directly into skeletal muscle results in significant expression of genes within the 
muscle cells. J.A. Wolff, et al., Science 247: 1465-1468 (1990). Forcing RNA or 
DNA introduced into muscle cells by other means such as by particle-acceleration 
(N. -S. Yang, et al. Proc. Natl. Acad. Sci. USA 87:9568-9572 (1990); S.R. Williams 
et al., Proc. Natl. Acad. Sci. USA 88:2726-2730 (1991)) or by viral transduction 
should also allow the DNA or RNA to be stably maintained and expressed. In the 
experiments reported in Wolff et al., RNA or DNA vectors were used to express 
reporter genes in mouse skeletal muscle cells, specifically cells of the quadriceps 
muscles. Protein expression was readily detected and no special delivery system 
was required for these effects. Polynucleotide expression was also obtained when 
the composition and volume of the injection fluid and the method of injection were 
modified from the described protocol. For example, reporter enzyme activity was 
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reported to have been observed with 10 to 100 jlxI of hypotonic, isotonic, and 
hypertonic sucrose solutions, Opti-MEM, or sucrose solutions containing 2mM 
CaCl2 and also to have been observed when the 10- to 100- jliI injections were 
performed over 20 min. with a pump instead of within 1 min. 

Enzymatic activity from the protein encoded by the reporter gene 
was also detected in abdominal muscle injected with the RNA or DNA vectors, 
indicating that other muscles can take up and express polynucleotides. Low 
amounts of reporter enzyme were also detected in other tissues (liver, spleen, skin, 
lung, brain and blood) injected with the RNA and DNA vectors. Intramuscularly 
injected plasmid DNA has also been demonstrated to be stably expressed in non- 
human primate muscle. S. Jiao et al., Hum. Gene Therapy 3:21-33 (1992). 

It has been proposed that the direct transfer of genes into human 
muscle in situ may have several potential clinical applications. Muscle is 
potentially a suitable tissue for the heterologous expression of a transgene that 
would modify disease states in which muscle is not primarily involved, in addition 
to those in which it is. For example, muscle tissue could be used for the 
heterologous expression of proteins that can immunize, be secreted in the blood, or 
clear a circulating toxic metabolite. The use of RNA and a tissue that can be 
repetitively accessed might be useful for a reversible type of gene transfer, 
administered much like conventional pharmaceutical treatments. See J.A. Wolff, et 
al., Science 247:1465-1468 (1990) and S. Jiao et al., Hum. Gene Therapy 3:21-33 
(1992). 

It had been proposed by J.A. Wolff et al., supra , that the intracellular 
expression of genes encoding antigens might provide alternative approaches to 
vaccine development. This hypothesis has been supported by a recent report that 
plasmid DNA encoding influenza A nucleoprotein injected into the quadriceps of 
B ALB/c mice resulted in the generation of influenza A nucleoprotein-specific 
cytotoxic T lymphocytes (CTLs) and protection from a subsequent challenge with a 
heterologous strain of influenza A virus, as measured by decreased viral lung titers, 
inhibition of mass loss, and increased survival. J. B. Ulmer et al., Science 
259:1745-1749 (1993). 
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Therefore, it appears that the direct injection of RNA or DNA vectors 
encoding the viral antigen can be used for endogenous expression of the antigen to 
generate the viral antigen for presentation to the immune system without the need 
for self -replicating agents or adjuvants, resulting in the generation of antigen- 
specific CTLs and protection from a subsequent challenge with a homologous or 
heterologous strain of virus. 

CTLs in both mice and humans are capable of recognizing epitopes 
derived from conserved internal viral proteins and are thought to be important in the 
immune response against viruses. By recognition of epitopes from conserved viral 
proteins, CTLs may provide cross-strain protection. CTLs specific for conserved 
viral antigens can respond to different strains of virus, in contrast to antibodies, 
which are generally strain-specific. 

Thus, direct injection of RNA or DNA encoding the viral antigen has 
the advantage of being without some of the limitations of direct peptide delivery or 
viral vectors. See J. A. Ulmer et al., supra , and the discussions and references 
therein). Furthermore, the generation of high-titer antibodies to expressed proteins 
after injection of DNA indicates that this may be a facile and effective means of 
making antibody-based vaccines targeted towards conserved or non-conserved 
antigens, either separately or in combination with CTL vaccines targeted towards 
conserved antigens. These may also be used with traditional peptide vaccines, for 
the generation of combination vaccines. Furthermore, because protein expression is 
maintained after DNA injection, the persistence of B and T cell memory may be 
enhanced, thereby engendering long-lived humoral and cell-mediated immunity. 
For example, Shriver et al. (1996) describes long-lived T cell responses in mice and 
non-human primates treated with vaccine compositions containing Rev-independent 
fflV gag and HIV gag vectors and Hel et al., J. Immunol, 167:7180-7181 (2001), 
specifically incorporated by reference herein, describes Rev-independent expression 
SIV gag and SIV env vectors and their potentiation of SIV-specific CD4* and CD8 + 
T cell responses in both naive and infected macaques by a DNA-SIV and NYVAC- 
SIV prime/boost regimen. 
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1. Vectors for the immunoprophylaxis or 
immunotherapy against HTV-1 

The mutated gag, pol or gag/pol sequences will be inserted in 
expression vectors using a strong constitutive promoter such as CMV or RSV, or an 
inducible promoter such as HIV-1. 

The vector will be introduced into animals or humans in a 
pharmaceutically acceptable carrier using one of several techniques such as 
injection of DNA directly into human tissues; electroporation or transfection of the 
DNA into primary human cells in culture (ex vivo) , selection of cells for desired 
properties and reintroduction of such cells into the body, (said selection can be for 
the successful homologous recombination of the incoming DNA to an appropriate 
preselected genomic region); generation of infectious particles containing the gag 
gene, infection of cells ex vivo and reintroduction of such cells into the body; or 
direct infection by said particles in vivo. 

Substantial levels of protein will be produced leading to an efficient 
stimulation of the immune system. 

In another embodiment of the invention, the described constructs will 
be modified to express mutated Gag proteins that are unable to participate in virus 
particle formation. It is expected that such Gag proteins will stimulate the immune 
system to the same extent as the wild-type Gag protein, but be unable to contribute 
to increased HIV-1 production. This modification should result in safer vectors for 
immunotherapy and immunophrophylaxis. 

EXAMPLE 10 

Inhibition of HIV-1 Expression Using Transdominant 
(TD)-TD-Gag-TD Rev or Td Gag-Pro-TD Rev Genes 

Direct injection of DNA or use of vectors other than retroviral 
vectors will allow the constitutive high level of trans-dominant Gag (TDgag) in 
cells. In addition, the approach taken by B.K. Felber et al., Science 239:184-187 
(1988) will allow the generation of retroviral vectors, e.g. mouse-derived retroviral 
vectors, encoding HIV-1 TDgag, which will not interfere with the infection of 
human cells by the retroviral vectors. In the approach of Felber, et al., supra , it was 
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shown that fragments of the HIV-1 LTR containing the promoter and part of the 
polyA signal can be incorporated without detrimental effects within mouse 
retroviral vectors and remain transcriptionally silent. The presence of Tat protein 
stimulated transcription from the HIV-1 LTR and resulted in the high level 
expression of genes linked to the HIV-1 LTR. 

The generation of hybrid TDgag-TDRev or TDgag-pro-TDRev genes 
and the introduction of expression vectors in human cells will allow the efficient 
production of two proteins that will inhibit HIV-1 expression. The incorporation of 
two TD proteins in the same vector is expected to amplify the effects of each one on 
viral replication. The use of the HIV-1 promoter in a matter similar to one 
described in B.K. Felber, et al., supra , will allow high level Gag and Rev expression 
in infected cells. In the absence of infection, expression will be substantially lower. 
Alternatively, the use of other strong promoters will allow the constitutive 
expression of such proteins. This approach could be highly beneficial, because of 
the production of a highly immunogenic gag, which is not able to participate in the 
production of infectious virus, but which, in fact, antagonizes such production. This 
can be used as an efficient immuniprophylactic or immunotherapeutic approach 
against AIDS. 

Examples of trans-dominant mutants are described in Trono et al., 
Cell 59:112-120 (1989). 

1 . Generation of constructs encoding transdominant Gag mutant 

proteins 

Gag mutant proteins that can act as trans-dominant mutants, as 
described, for example, in Trono et al., supra , will be generated by modifying vector 
p37Ml-10D or p55Ml-13PO to produce transdominant Gag proteins at high 
constitutive levels. 

The transdominant Gag protein will stimulate the immune system 
and will inhibit the production of infectious virus, but will not contribute to the 
production of infectious virus. 

The added safety of this approach makes it more acceptable for 
human application. 
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Those skilled in the art will recognize that any gene encoding a 
mRNA containing an inhibitory/instability sequence or sequences can be modified 
in accordance with the exemplified methods of this invention or their functional 
equivalents. 

Modifications of the above described modes for carrying out the 
invention that are obvious to those of skill in the fields of genetic engineering, 
virology, immunology, medicine, and related fields are intended to be within the 
scope of the following claims. 

Every reference cited hereinbefore throughout the application is 
hereby incorporated by reference in its entirety. 
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WHAT IS CLAIMED IS : 

1. A nucleic acid construct comprising a HTV-1 gag/pol gene 
having the coding sequence of the gag/pol gene set forth in Figure 1. 

2. A nucleic acid construct comprising a HIV-1 pol gene having 
the coding sequence of the pol gene set forth in Figure 2. 

3. A nucleic acid construct comprising a SIV-1 gag gene having 
the coding sequence of the gag gene set forth in Figure 3. 

4. A nucleic acid construct comprising an HIV or SIV 5' LTR, 
a packaging signal, a gag/pol gene comprising the sequence set forth in Figure 1, a 
5' splice site, a 3' splice site, an env gene, a tat gene, a functional RNA transport 
element and a 3' HIV or SIV LTR, said nucleic acid construct being able to produce 
functional Gag, Pol and Env virion components. 

5. A vector comprising the nucleic acid construct of Claim 1, 2, 

6. A transformed host cell comprising the nucleic acid construct 
3 or 4. 

7. A transformed host cell of Claim 6 wherein said cell is a 

8. The host cell of Claim 7 wherein said cell is a human celL 

9. A transformed host cell of Claim 6 wherein said cell is a 

10. The host cell of Claim 9 wherein said cell is E. coli. 

11. A pharmaceutical composition comprising the nucleic acid 
construct of Claim 1, 2, 3 or 4 and a pharmaceutically acceptable carrier. 

12. A method for inducing antibodies in a mammal comprising 
administering to a mammal a composition of claim 11, wherein said nucleic acid 
construct is present in an amount which is effective to induce said antibodies in said 
mammal. 

13. A method for inducing cytotoxic T lymphocytes in a mammal 
comprising administering to a mammal a composition of claim 11, wherein said 
nucleic acid construct is present in an amount which is effective to induce cytotoxic 
T lymphocytes in said mammal. 



3 or 4. 

of Claim 1,2, 
eukaryote. 

prokaryote. 
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14. A vaccine composition for inducing immunity in a mammal 
against HTV infection comprising a pharmaceutical^ acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 1 
capable of producing HIV Gag and Pol proteins in the absence of HIV Rev 
regulatory protein in a cell in vivo . 

15. A vaccine composition for inducing immunity in a mammal 
against HIV infection comprising a pharmaceutically acceptable carrier and further 
comprising a therapeutically effective amount of a nucleic acid construct of Claim 2 
capable of producing HIV Pol protein in the absence of HTV Rev regulatory protein 
in a cell in vivo . 

16. A vaccine composition according to claim 14 wherein said 
mammal is a human. 

17. A vaccine composition according to claim 15 wherein said 
mammal is a human. 

18. A method for inducing immunity against HTV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 14. 

19. A method for inducing immunity against HIV infection in a 
mammal which comprises administering to a mammal a therapeutically effective 
amount of a vaccine composition according to claim 15. 

20. A method according to claim 18 wherein said mammal is a 

21. A method according to claim 19 wherein said mammal is a 

22. A lentiviral expression system comprising the following: 

(a) a packaging vector comprising a HIV-1 gag/pol gene having 
the nucleotide sequence set forth in Figure 1; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 

23. A transformed host cell comprising the lentiviral expression 
system of Claim 22. 



human. 



human. 
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24. A transformed host cell of Claim 23 wherein said cell is a 

eukaryote. 

25. The host cell of Claim 24 wherein said cell is a human cell. 

26. A process for making a lenti viral particle comprising 
expressing HIV Gag and HIV Pol in a host cell from a vector comprising the 
nucleotide sequences encoding HIV Gag and HIV Pol set forth in Figure 1 in the 
presence of a gene encoding an envelope protein. 

27. A lenti viral expression system which is capable of 
functioning in the absence of Rev, Tat, and any viral RNA transport element 
comprising the following: 

(a) a packaging vector comprising a HIV-1 gag/pol gene which 
has been mutated to eliminate inhibitory/instability regions; 

(b) a transfer vector; and 

(c) an envelope encoding vector. 

28. A transformed host cell comprising the lenti viral expression 
system of Claim 27. 

29. A transformed host cell of Claim 28 wherein said cell is a 

eukaryote. 

30. The host cell of Claim 29 wherein said cell is a human cell. 

31. A process for making a lenti viral particle in the absence of 
Rev, Tat, or any viral RNA transport element comprising expressing HIV Gag and 
HIV Pol in a host cell from a HIV-1 gag/pol gene which has been mutated to 
eliminate inhibitory/instability regions and expressing an Envelope protein from a 
envelope encoding gene whose expression is independent Rev, Tat, or any viral 
RNA transport element. 

32. A nucleic acid construct comprising a SIV-l env gene having 
the coding sequence of the env gene set forth in Figure 16. 

33. A vector comprising the nucleic acid construct of claim 32. 

34. A transformed host cell comprising the nucleic acid construct 

of claim 32. 

35. A pharmaceutical composition comprising the nucleic acid 
construct of claim 32 and a pharmaceutically acceptable carrier. 
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36. A lentiviral expression system of claim 27 wherein the HIV-1 
gag/pol gene has the coding sequence of the HIV-1 gag/pol gene set forth in Figure 
1. 

37. The process of claim 31 wherein the HIV-1 gag/pol gene has 
the coding sequence of the HIV-1 gag/pol gene set forth in Figure 1. 

38. A lentiviral expression system of claim 27 wherein the 
packaging vector has the DNA sequence of packaging construct 
pCMVgag/polBNKan set forth in Figure 9. 

39. A lentiviral expression system of claim 27 wherein the 
transfer vector has the DNA sequence of pmBCwCNluci set forth in Figure 10 or 
pmBCmCNluci set forth in Figure 11. 
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ATGGGTGCGAGAGCGTCAGTATTAAGCGGGGGAGAATTAGATCGATGGGAAAAAAT 
TCGGTTAAGGCCAGGGGGAAAGAAGAAGTACAAGCTAAAGCACATCGTATGGGC7\A 
GCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACATCAGAAGGC 
TGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAGGAGCT 
TCGATCACTATACAACACAGTAGCAACCCTCTATTGTGTGCACCAGCGGATCGAGA 
TCAAGGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAGTCCAAG 
AA.GAAGGCCCAGCAGGCAGCAGCTGACACAGGACACAGCAATCAGGTCAGCGAAAA 
TTACCCTATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTA 
GAACTTTAAATGCATGGGTAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTG 
ATACCCATGTTTTCAGCATTATCAGAAGGAGCCACCCCACAGGACCTGAACACGAT 
GTTGAACACCGTGGGGGGACATCAAGCAGCCATGCAAATGTTAAAAGAGACCATCA 
ATGAGGAAGCTGCAGAATGGGATAGAGTGCATCCAGTGCATGCAGGGCCTATTGCA 
CCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACCCT 
T C AG G AAC AAAT AG GAT GG ATG AC AAATAAT CC ACCTATCC C AG TAGGAG AG ATCT 
ACAA.GAGGTGGATAATCCTGGGATTGAACAAGATCGTGAGGATGTATAGCCCTACC 
AGCATTCTGGACATAAGACAAGGACCAAAGGAACCCTTTAGAGACTATGTAGACCG 
GTTCTATAAAACTCTAAGAGCTGAGCAAGCTTCACAGGAGGTAAAAAATTGGATGA 
CAGAAA.CCTTGTTGGTCCAAAATGCGAACCCAGATTGTAAGACCATCCTGAAGGCT 
CTCGGCCCAGCGGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGG 
ACCCGGCCATAAGGCAAGAGTTTTGGCCGAGGCGATGAGCCAGGTGACGAACTCGG 
CGACCATAATGATGCAGAGAGGCAACTTCCGGAACCAGCGGAAGATCGTCAAGTGC 
TTCAATTGTGGCAAAGAAGGGCACACCGCCAGGAACTGCCGGGCCCCCCGGAAGAA 
GGGCTGTTGGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTGAGAGAC 

FIG. I 
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AG G C T AAT T T T T TAG G G AAG AT C TG G CC T TCC T ACAAGGG AAGGCC AGG G AAT T T T 
CTTCAGAGCAGACCAGAGCCAACAGCCCCACCAGAAGAGAGCTTCAGGTCTGGGGT 
AGAGACAACAACTCCCCCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTT 
TAACTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAGTAAGGATCGGGG 
GGCAACTCAAGGAAGCGCTGCTCGATACAGGAGCAGATGATACAGTATTAGAAGAA 
ATGAGTTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGGATCGGGGGCTTCAT 
CA^GGTGAGGCAGTACGACCAGATACTCATAGAAATCTGTGGACATAAAGCTATAG 
GTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAGAAATCTGTTGACC 
CAGATCGGCTGCACCTTGAACTTCCCCATCAGCCCTATTGAGACGGTGCCCGTGAA . 
G T T G AAG C C G GGG AT G G AC GGCCCC AAGG TC AAGC AATGGCC AT TGACGAAAGAGA 
. AGATCAAGGCCTTAGTCGAAATCTGTACAGAGATGGAGAAGGAAGGGAAGATCAGC 
AAGATCGGGCCTGAGAACCCCTACAACACTCCAGTCTTCGCAATCAAGAAGAAGGA 
C AG T ACC AAG TGG AG AAAG C TGG TGG AC T TC AG AG AGC TG AAC AAG AG AAC TC AGG 
ACTTCTGGGAAGTTCAGCTGGGCATCCCACATCCCGCTGGGTTGAAGAAGAAGAAG 
T C AG TG AC AG TGCTGGATGTGGG TG ATGCC T AC T TC TC CG T TCCC T TGG AC G AGG A 
C T T C AG G AAG T AC AC T GCC T T C ACG AT ACC TAG C ATC AAC AACG AG AC ACC AGGC A 
TCCGCTACCAGTACAACGTGCTGCCACAGGGATGGAAGGGATCACCAGCCATCTTT 
CAAAGCAGCATGACCAAGATCCTGGAGCCCTTCCGCAAGCAAAACCCAGACATCGT 
GATC TATC AG TACATGG ACGACC TCTACGTAGG AAGTGACCTGGAGATCGGGCAGC 
ACAGGACCAAGATCGAGGAGCTGAGACAGCATCTGTTGAGGTGGGGACTGACCACA 
CCAGACAAGAAGCACCAGAAGGAACCTCCCTTCCTGTGGATGGGCTACGAACTGCA 
TCCTGACAAGTGGACAGTGCAGCCCATCGTGCTGCCTGAGAAGGACAGCTGGACTG 
TGAACGACATACAGAAGCTCGTGGGCAAGTTGAACTGGGCAAGCCAGATCTACCCA 
GGCATCAAAGTTAGGCAGCTGTGCAAGCTGCTTCGAGGAACCAAGGCACTGACAGA 
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AG T GAT C C CAC T G ACAG AG G AAG C AG AG C T AG AAC T G GCAGAG AAC GG AG AG AT C C 

TGAAGGAGCCAGTACATGGAGTGTACTACGACCCAAGCAAGGACCTGATCGCAGAG 
ATCCAGAAGCAGGGGCAAGGCCAATGGACCTACCAAATCTACCAGGAGCCCTTCAA 
GAACCTGAAGACAGGCAAGTACGCAAGGATGAGGGGTGCCCACACCAACGATGTGA 
AGCAGCTGACAGAGGCAGTGCAGAAGATCACCACAGAGAGCATCGTGATCTGGGGC 
AAGACTCCCAAGTTCAAGCTGCCCATACAGAAGGAGACATGGGAGACATGGTGGAC 
CGAGTACTGGCAAGCCACCTGGATCCCTGAGTGGGAGTTCGTGAACACCCCTCCCT 

TGGTGAAACTGTGGTATCAGCTGGAGAAGGAACCCATCGTGGGAGCAGAGACCTTC 
TACGTGGATGGGGCAGCCAACAGGGAGACCAAGCTGGGCAAGGCAGGCTACGTGAC 
CAACCG AGGACG AC AGAAAG TGG TG ACCC TG AC TGACACC ACCAACC AG AAGAC TG 

AGCTGCAAGCCATCTACCTAGCTCTGCAAGACAGCGGACTGGAAGTGAACATCGTG 
ACAGACTCACAGTACGCACTGGGCATCATCCAAGCACAACCAGACCAATCCGAGTC 
AG AGCTGG TG AAC CAGAT C ATCGAGCAGC TGATCAAGAAGG AGAAAG TGTACCTGG 
C ATGGG TACC AG C AC ACAAAGG AAT TGG AGGAAATGAAC AAG TAGATAAAT T AGTC 
AGTGCTGGGATCCGGAAGGTGCTGTTCCTGGACGGGATCGATAAGGCCCAAGATGA 
ACATGAGAAGTACCACTCCAACTGGCGCGCTATGGCCAGCGACTTCAACCTGCCAC 
CTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTCAGCTAAAAGGAGAA 
GCCATGCATGGACAAGTAGACTGTAGTCCAGGAATATGGCAGCTGGACTGCACGCA 
CCTGGAGGGGAAGGTGATCCTGGTAGCAGTTCATGTAGCCAGTGGATATATAGAAG 

* 

CAGAAGTTATCCCTGCTGAAACTGGGCAGGAAACAGCATATTTTCTTTTAAAATTA 
GCAGGAAGATGGCCAGTAAAAACAATACACACGGACAACGGAAGCAACTTCACTGG 
TGCTACGGTTAAGGCCGCCTGTTGGTGGGCGGGAATCAAGCAGGAATTTGGAATTC 
CCTACAATCCCCAATCGCAAGGAGTCGTGGAGAGCATGAACAAGGAGCTGAAGAAG 
ATCATCGGACAAGTGAGGGATCAGGCTGAGCACCTGAAGACAGCAGTGCAGATGGC 
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AGTGTTCATCCACAACTTCAAAAGAAAAGGGGGGATTGGGGGGTACAGTGCAGGGG 
AAAGGATCGTGGACATCATCGCCACCGACATCCAAACCAAGGAGCTGCAGAAGCAG 
ATCACCAftGATCCAGAACTTCCGGGTGTACTACCGCGACAGCCGCAACCCACTGTG 
GAAGGGACCAGCAAAGCTCCTCTGGAAGGGAGAGGGGGCAGTGGTGATCCAGGACA 
ACAGTGACATCAAAGTGGTGCCAAGGCGCAAGGCCAAGATCATCCGCGACTATGGA 
AAACAGATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACCT 
GGAAGAGCCTGGTGAAGCACCATATG (SEQUENCE ID NO:l) 
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>wildtype TGTACAGAGA TGGAAAAGGA AGGGAAAATT TCAAAAATTG 
>mutated TGTACAGAGA TGGAGAAGGA AGGGAAGATC AGCAAGATCG 
#1 





* * * 


*** * * 


>wildtype 
>mutated 
#41 


GGCCTGAAAA TCCATACAAT ACTCCAGTAT TTGCCATAAA 
GGCCTGAGAA CCCCTACAAC ACTCCAGTCT TCGCAATCAA 




* * * * * 


* * * 


>wildtype 
>mutated 


GAAAAAAGAC AGTACTAAAT GGAGAAAATT 
GAAGAAGGAC AGTACCAAGT GGAGAAAGCT 


AGTAGATTTC 
GGTGGACTTC 




* * * * ** 


* * * 


>wildtype 
>mutated 
#121 


AGAGAACTTA ATAAGAGAAC TCAAGACTTC 
AGAGAGCTGA ACAAGAGAAC TCAGGACTTC 


TGGGAAGTTC 
TGGGAAGTTC 




* * * * 




>wildtype 
>mutated 
#161 


AATTAGGAAT ACCACATCCC GCAGGGTTAA AAAAGAAAAA 
AGCTGGGCAT CCCACATCCC GCTGGGTTGA AGAAGAAGAA 




***** * ♦ 


* * 


>wildtype 
>mutated 
#201 


ATCAGTAACA GTACTGGATG TGGGTGATGC 
GTCAGTGACA GTGCTGGATG TGGGTGATGC 


ATATTTTTCA 
CTACTTCTCC 




* * ♦ 


* * * * 


>wildtype 
>mutated 
#241 


GTTCCCTTAG ATGAAGACTT CAGGAAATAT 
GTTCCCTTGG ACGAGGACTT CAGGAAGTAC 


ACTGCATTTA 
ACTGCCTTCA 




* * * * * 


* * 


>wildtype 
>mutated 
#281 


CCATACCTAG TATAAACAAT GAGACACCAG 
CGATACCTAG CATCAACAAC GAGACACCAG 


GGATTAGATA 
GCATCCGCTA 




* * * * 


* * * * 


>mutated 
#321 


TCAGTACAAT GTGCTTCCAC AG GG ATGGAA AGGATCACCA 
CCAGTACAAC GTGCTGCCAC AGGGATGGAA GGGATCACCA 




* * * 


* 


>wildtype 
>mutated 
#361 


GCAATATTCC AAAGTAGCAT GACAAAAATC 
GCCATCTTTC AAAGCAGCAT GACCAAGATC 


TTAGAGCCTT 
CTGGAGCCCT 




* * * * * * 


* * * 


>wildtype 
>mutated 
#401 


TTAGAAAACA AAATCCAGAC ATAGTTATCT ATCAATACAT 
TCCGCAAGCA AAACCCAGAC ATCGTGATCT ATCAGTACAT 




** * * * * * 


* 



FIG. 2 
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>wildtype 
>mutated 
#441 



GGATGATTTG TATGTAGGAT CTGACTTAGA AATAGGGCAG 
GGACGACCTC TACGTAGGAA GTGACCTGGA GATCGGGCAG 



>wildtype 
>mutated 
#481 



CATAGAACAA AAATAGAGGA GCTGAGACAA CATCTGTTGA 
CACAGGACCA AGATCGAGGA CCTGAGACAG CATCTGTTGA 



>wildtype 
>xnutated 
#521 



GGTGGGGACT TACCACACCA GACAAAAAAC ATCAGAAAGA 
GGTGGGGACT GACCACACCA GACAAGAAGC ACCAGAAGGA 



>wildtype 
>mutated 
#561 



ACCTCCATTC CTTTGGATGG GTTATGAACT CCATCCTCAT 
ACCTCCCTTC CTGT6GATGG GCTACGAACT GCATCCTGAC 



>wildtype 
>mutated 
#601 



AAATGGACAG TACAGCCTAT AGTGCTGCCA GAAAAAGACA 
AAGTGGACAG TGCAGCCCAT CGTGCTGCCT C AG AAG G ACA 



>wildtype 
>rnutated 
#641 



GCTGGACTGT CAATGACATA CAGAAGTTAG TGGGGAAATT 
GCTGGACTGT GAACGACATA CAGAAGCTCG TGGGCAAGTT 



>wildtype 
>mutated 
#681 



GAATTGGGCA AGTCAGATTT ACCCAGGGAT TAAAGTAAGG 
GAACTGGGCA AGCCAGATCT ACCCAGGCAT CAAAGTTAGG 



>wildtype 
>mutated 
#721 



CAATTATGTA AACTCCTTAG AGGAACCAAA GCACTAACAG 
CAGCTGTGCA AGCTGCTTCG AG GAACCAAG GCACTGACAG 



> wild type 
>mutated 
#761 



AAGTAATACC ACTAACAGAA -GAAGCAGAGC TAGAACTGGC 
AAGTGATCCC ACTGACAGAG GAAGCAGAGC TAGAACTGGC 



>wildtype 
>mutated 
#601 



AGAAAACAGA GAGATTCTAA AAGAACCAGT ACATGGAGTG 
AGAGAACCGA GAGATCCTGA AGGAGCCAGT ACATGGAGTG 



>wildtype 
>mutated 
#841 



TATTATGACC CATCAAAAGA CTTAATACCA GAAATACAGA 
TACTACGACC CAAGCAAGGA CCTGATCGCA GAGATCCAGA 



* ** 



* * 
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>wildtype 
>mutated 
#861 



AGCAGGGGCA AGGCCAATGG ACATATCAAA TTTATCAAGA 
AGCAGGGGCA AGGCCAATGG ACCTACCAAA TCTACCAGGA 



>wildtype 
>mutated 
#921 



GOCATTTAAA AATCTGAAAA CAGGAAAATA TGCAAGAATG 
GCCCTTCAAG AACCTGAAGA CAGGCAAGTA CGCAAGGATG 



>wildtype 
>mutated 
#961 



AGGGGTGCCC ACACTAATGA TGTAAAACAA TTAACAGAGG 
AGGGGTGCCC ACACCAACGA TGTGAAGCAG CTGACAGAGG 



>wildtype 
>mutated 
#1001 



CAGTGCAAAA AATAACCACA GAAAGCATAG TAATATGGGG 
CAGTGCAGAA GATCACCACA GAGAGCATCG TGATCTGGGG 



>wildtype 
>mutated 
#1041 



AAAGACTCCT AAATTTAAAC TGCCCATACA AAAGGAAACA 
CAAGACTCCC AAGTTCAAGC TGCCCATACA GAAGGAGACA 



>wildcype 
>mutaced 
#1081 



TGGGAAACAT GGTGGACAGA GTATTGGCAA GCCACCTGGA 
TGGGAGACAT GGTGGACCGA GTACTGGCAA GCCACCTGGA 



>wildtype 
>mutated 
#1121 



TTCCTGAGTG GGAGTTTGTT AATACCCCTC CTTTAGTGAA 
TCCCTGAGTG GGAGTTCGTG AACACCCCTC CCTTGGTGAA 



>wildtype 
>mutated 
#1161 



ATTATGGTAC CAGTTAGAGA AAGAACCCAT AGTAGGAGCA 
ACTGTGGTAT CAGCTGGAGA AGGAACCCAT CGTGGGAGCA 



* * 



>wildtype 
>mutated 
#1201 



GAAACCTTCT ATGTAGATGG GGCAGCTAAC AGGGAGACTA 
GAGACCTTCT ACGTGGATGG GGCAGCCAAC AGGGAGACCA 



>wildtype 
>mutatcd 
#1241 



AATTAGGAAA AGCAGGATAT GTTACTAATA GAGGAAGACA 
AGCTGGGCAA GGCAGGCTAC GTGACCAACC GAG GACGACA 



>wildtype 
>mutated 
#1281 



AAAAGTTGTC ACCCTAACTG ACACAACAAA TCAGAAGACT 
GAAAGTGGTG ACCCTGACTG ACACCACCAA CCAGAAGACT 
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>wildtype 
>mutated 
81321 



GAGTTACAAG CAATTTATCT AGCTTTGCAG GATTCGGGAT 
GAGCTGCAAG CCATCTACCT AGCTCTGCAA GACAGCG GAC 



* * * 



**** 



>wildtype 
>mutated 
81361 



TAGAAGTAAA CATAGTAACA GACTCACAAT ATGCATTAGG 
TGGAAGTGAA CATCGTGACA GACTCACAGT ACGCACTGGG 



>wildtype 
>mutated 
81401 



AATCATTCAA GCACAACCAG ATCAAAGTGA ATCAGAGTTA 
CATCATCCAA GCACAACCAG ACCAATCCGA GTCAGAGCTG 



** * 



>wildtype 
>mutated 
81441 



GTCAATCAAA TAATAGAGCA GTTAATAAAA AAGGAAAAGG 
GTGAACCAGA TCATCGAGCA GCTGATCAAG AAGGAGAAAG 



* * 



* * * * 



>wildtype 
>mutated 
81481 



TCTATCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG 
TGTACCTGGC ATGGGTACCA GCACACAAAG GAATTGGAGG 



>wildtype 
>mutated 
81521 



AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGAATCAGG 
AAATGAACAA GTAGATAAAT TAGTCAGTGC TGGGATCCGG 



>wildtype 
>mutated 
81561 



AAAGTACTAT TTTTAGATGG AATAGATAAG GCCCAAGATG 
AAGGTGCTGT TCCTGGACGG GATCGATAAG GCCCAAGATG 



>wildtype 
>mutated 
81601 



AACATGAGAA ATATCACAGT AATTGGAGAG CAATGGCTAG 
AACATGAGAA GTACCACTCC AACTGGCGCG CTATGGCCAG 



* * * 



>wildtype 
>mutated 
81641 



TGATTTTAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 
CGACTTCAAC CTGCCACCTG TAGTAGCAAA AGAAATAGTA 



>wildtype 
>mutated 
81681 



GCCAGCTGTG ATAAATGTCA GCTAAAAGGA -GAAGCCATGC 
GCCAGCTGTG ATAAATGTCA GCTAAAAGGA GAAGCCATGC 



* 

>wildtype ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAACTAGA 

> mutated ATGGACAAGT AGACTGTAGT CCAGGAATAT GGCAGCTGGA 

81721 
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>wildtype TTGTACACAT TTAGAAGGAA AAGTTATCCT GGTAGCAGTT 
>mutated CTGCACGCAC CTGGAGGGGA AGGTGATCCT GGTAGCAGTT 
#1761 



>wildtype 
>mutated 
#1801 



CATGTAGCCA GTG GATATAT AGAAGCAGAA GTTATTCCAG 
CATGTAGCCA GTGGATATAT AGAAGCAGAA GTTATCCCTG 



>wildtype 
>mutated 
#1841 



CAGAAACAGG GCAGGAAACA GCATATTTTC TTTTAAAATT 
CTGAAACTGG GCAGGAAACA GCATATTTTC TTTTAAAATT 



>wildtype 
>mutated 
#1861 



AGCAGGAAGA TGGCCAGTAA AAACAATACA TACAGACAAT 
AGCAGGAAGA TGGCCAGTAA AAACAATACA CACGGACAAC 



>wildtype 
>mutated 



#1921 



GGCAGCAATT TCACCAGTGC TACGGTTAAG GCCGCCTGTT 
GGAAGCAACT TCACTGGTGC TACGGTTAAG GCCGCCTGTT 



★ * 



>wildtype 
>mutated 
#1961 



GGTGGGCGGG AATCAAG CAG GAATTTGGAA TTCCCTACAA 
GGTGGGCGGG AATCAAGCAG GAATTTGGAA TTCCCTACAA 



>wildtype 
>mutated 
#2001 



TCCCCAAAGT CAAGGAGTAG TAGAATCTAT GAATAAAGAA 
TCCCCAATCG CA^GGAGTCG TGGAGAGCAT G AACAAG GAG 



♦ + * 



>wildtype 
>mutated 
#2041 



TTAAAGAAAA TTATAGGACA GGTAAGAGAT CAGGCTGAAC 
CTGAAGAAGA TCATCGGACA AGTGAGGGAT CAGGCTGAGC 



>wildtype 
>mutated 
#2081 



ATCTTAAGAC AGCAGTACAA ATGGCAGTAT TCATCCACAA 
ACCTGAAGAC AGCAGTGCAG ATGGCAGTGT TCATCCACAA 



>wildtype 
>mutated 
#2121 



TTTTAAAAGA AAAGGGGGGA TTGGGGGGTA CAGTGCAGGG 
CTTCAAAAGA AAAGGGGGGA TTGGGGGGTA CAGTGCAGGG 



>wildtype 
>mutated 
#2161 



GAAAGAATAG TAGACATAAT AGCAACAGAC ATACAAACTA 
GAAAGGATCG TGGACATCAT CGCCACCGAC ATCCAAACCA 



+ * * * 
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> wild type AAGAATTACA AAAACAAATT ACAAAAATTC AAAATTTTCG 
>mutated AGGAGCTGCA GAAGCAGATC ACCAAGATCC AGAACTTCCG 
#2201 



>wildtype <5GTTTATTAC AGGGACAGCA GAAATCCACT TTGGAAAGGA 

>mutated GGTGTACTAC CGCGACAGCC GCAACCCACT GTGGAAGGGA 

#2241 

* * * * * * * * * 



>wildtype CCAGCAAAGC TCCTCTGGAA AGGTGAAGGG GCAGTAGTAA 
>mutated CCAGCAAAGC TCCTCTGGAA GGGAGAGGGG GCAGTGGTGA 
#2281 



>wildtype TACAAGATAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA 

>inutated TCCAGGACAA CAGTGACATC AAAGTGGTGC CAAGGCGCAA 

#2321 

* * * * * * ** + 



>wildtype AGCAAAGATC ATTAGGGATT ATGGAAAACA GATGGCAGGT 
>mutated GGCCAAGATC ATCCGCGACT ATGGAAAACA GATGGCAGGT 
#2361 1 *. 



>wildtype GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACA 
>mutated GATGATTGTG TGGCAAGTAG ACAGGATGAG GATTAGAACC 
#2401 



>wildtype TG GAAAAGTT TAGTAAAACA CCATATG » 
>mutated TGGAAGAGCC TGGTGAAGCA CCATATG 
#2441 
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atgggcgtgagaaactccgtcttgtcagggaagaaagcagatgaattag 

aaaaaattaggctacgacccaacggaaagaaaaagtacatgttgaagc 

atgtagtatgggcagcaaatgaattagatagatttggattagcagaaag 

cctgttggagaacaaagaaggatgtcaaaaaatactttcggtcttagct 

ccattagtgccaacaggctcagaaaatttaaaaagcctttataatactg 

tctgcgtcatctggtgcattcacgcagaagagaaagtgaaacacactga 

ggaagcaaaacagatagtgcagagacacctagtggtggaaacaggaac 

caccgaaaccatgccgaagacctctcgaccaacagcaccatctagcggc 

agaggaggaaactacccagtacagcagatcggtggcaactacgtccac 

ctgccactgtccccgagaaccctgaacgcttgggtcaagctgatcgagg 

agaagaagttcggagcagaagtagtgccaggattccaggcactgtcag 

aaggttgcaccccctacgacatcaaccagatgctgaactgcgttggaga 

ccatcaggcggctatgcagatcatccgtgacatcatcaacgaggaggct 

gcagattgggacttgcagcacccacaaccagctccacaacaaggacaa 

cttagggagccgtcaggatcagacatcgcaggaaccacctcctcagttg 

acgaacagatccagtggatgtaccgtcagcagaacccgatcccagtagg 

caacatctaccgtcgatggatccagctgggtctgcagaaatgcgtccgt 

atgtacaacccgaccaacattctagatgtaaaacaagggccaaaagag 

ccatttcagagctatgtagacaggttctacaaaagtttaagagcagaac 

agacagatgcagcagtaaagaattggatgactcaaacactgctgattca 

aaatgctaacccagattgcaagctagtgctgaaggggctgggtgtgaat 

cccaccctagaagaaatgctgacggcttgtcaaggagtaggggggccg 

ggacagaaggctagattaatggcagaagccctgaaagaggccctcgca 

ccagtgccaatcccttttgcagcagcccaacagaggggaccaagaaagc 

caattaagtgttggaattgtgggaaagagggacactctgcaaggcaatg 

cagagccccaagaagacagggatgctggaaatgtggaaaaatggacca 

tgttatggccaaatgcccagacagacaggcgggttttttaggccttggt 

ccatggggaaagaagccccgcaatttccccatggctcaagtgcatcagg 

ggctgatgccaactgctcccccagaggacccagctgtggatctgctaaa 

gaactacatgcagttgggcaagcagcagagagaaaagcagagagaaag 

cagagagaagccttacaaggaggtgacagaggatttgctgcacctcaat 

tctctctttggaggagaccagtag 
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SIV gag — 

#1 

ATGGGCGTGAGAAACTCCGTCTTGTCAGGGA&GAAAGCAG 

SIV gag 

#41 

ATG AAT T AG AAAAAAT T AGGCT ACG ACCC AACGG AAAGAA 

SIV gag 

#81 

AAAG T AC ATG T TG AAGC ATG TAG TATGGG C AG CAAATG AA 

SIV gag 

#121 

TTAGATAGATTTGGATTAGCAGAAAGCCTGTTGGAGAACA 

SIV gag 

#161 

AAGAAGGATGTCAAAAAATACTTTCGGTCTTAGCTCCATT 

SIV gag 

#201 

AG T GCC AACAGG C TC AG AAAAT T TAAAAAGCC T TTATAAT 

SIV gag 

#241 

ACTGTCTGCGTCATCTGGTGCATTCACGCAGAAGAGAAAG 

SIV gag 

SIVgagDX.. 

#281 

T G AAACAC AC TG AGGAAGC AAAACAG AT AG TGCAGAGACA 

SIV gag A — A T A — A 

SIVgagDX. . C — C C G — G 

#321 

CC TAG TGG TGGAAACAGG AACMACMGAAAC YATGCCRAAR 

SIV gag — AAG-A 

SIVgagDX. . — CTC-C 

#361 

ACMWS TMGACCAACAGCACCATCTAGCGGCAGAGGAGGAA 
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SIV gag -T A — A — A T 

S IVgagDX . . -C G — G — C C 

#401 

AYTACCCAGTACARCARATMGGTGGTAACTAYGTCCACCT 

SIV gag T-AAG AT -A — T — C A — AT — 

S IVgagDX. . C-GTC CC-G — C — T C — GC — 

#441 

GCCAYTRWSCCCGAGAACMYTRAAYGCYTGGGTMAARYTG 

SIV gag — A A A— T 

S IVgagDX. . — C G G — C 

#4 81 ## 

ATMGAGGARAAGAARTTYGGAGCAGAAGTAGTGCCAGGAT 

SIV gag -T T T 

SIVgagDX . . -C C C 

#521 

TYCAGGCACTGTCAGAAGGTTGCACCCCCTAYGACATYAA 

SIV gag T T-A — T — T — G A 

SIVgagDX . . C c-G — C — C — T G 

#561 

YCAGATGYTRAAYTGYGTKGGAGACCATCARGCGGCTATG 

SIV gag T A- A — T — T — A 

SIVgagDX. . C C-T — C — C — C 

#601 

CAGATYATCMGWGAYATYATMAACGAGGAGGCTGCAGATT 

SIV gag 

SIVgagDX. . 

#641 

GGGACTTGCAGCACCCACAACCAGCTCCACAACAAGGACA 

SIV gag T — T A — T 

SIVgagDX. . C — C -C — C 

#681 

ACTTAGGGAGCCGTCAGGATCAGAYATYGCAGGAACMACY 

SIV gag AGT A— T A A-A— A- 

S IVgagDX. . TCC T — C G C-T — G- 

#721 

WS YTCAGTWGAYGAACARATCCAGTGGATGTACMGWCARC 
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SIV gag C — A T A-GA 

SIVgagDX. . G — C C C-TC 

#761 

AGAACCCSATMCCAGTAGGCAACATYTACMGKMGATGGAT 

SIV gag A GT A — A — T — CA-A T A 

S IVgagDX . . G TC — ■ — G — G — C — TC-T C G 

#801 

CCARCTGGGKYTGCARAARTGYGTYMGWATGTAYAACCCR 

SIV gag — A 

SIVgagDX . . — C 

#841 

ACMAACATTCTAGATGTAAAACAAGGGCCAAAAGAGCCAT 

SIV gag 

#881 

TTCAGAGCTATGTAGACAGGTTCTACAAAAGTTTAAGAGC 

SIV gag 

#921 

AGAACAGACAGATGCAGCAGTAAAGAATTGGATGACTCAA 

SIV gag 

#961 

ACACTGCTGATTCAAAATGCTAACCCAGATTGCAAGCTAG 

SIV gag 

#1001 

TGCTGAAGGGGCTGGGTGTGAATCCCACCCTAGAAGAAAT 

SIV gag 

#1041 

GC TGACGGC T TGTCAAGG AG TAGGGGGGCCGGGACAGAAG 

SIV gag 

#1081 

GCTAGATTAATGGCAGAAGCCCTGAAAGAGGCCCTCGCAC 

SIV gag 

#1121 

CAGTGCCAATCCCTT T TGCAGCAGCCCAACAGAGGGGACC 

SIV gag ■ 

#1161 

AAGAAAGCCAATTAAG TG T TGGAAT TG TGGGAAAG AGGG A 
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SIV gag 

#1201 

CACTCTGCAAGGCAATGCAGAGCCCCAAGAAGACACGGAT 

SIV gag 

#1241 

GC TGG AAATG TGG AAAAATGG ACCATG TTATGGCCAAATG 

SIV gag j. 

#1281 

CCCAGACAGACAGGCGGGTTTTTTAGGCCTTGGTCCATGG 

SIV gag 

#1321 

GGAA&GAAGCCCCGCAATTTCCCCATGGCTCAAGTGCATC 

SIV gag 

#1361 

AGGGGCTGATGCCAACTGCTCCCCCAGAGGACCCAGCTGT 

SIV gag 

#1401 

G GAT C TG C TAAAG AAC T ACATGC AG T TGGGC AAGC AGCAG 

SIV gag 

#1441 

AGAG AAAAGCAGAGAGAAAGCAGAGAGAAGCC TTACAAGG 

SIV gag 

#1481 

AGGTGACAGAGGATTTGCTGCACCTCAATTCTCTCTTTGG 

SIV gag 

#1521 

AGGAGACCAGTAG 
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1 


CCTGGCCATT 


GCATACGTTG 


TATCCATATC 


ATAATATGTA 


CATTTATATT 


GGCTCATGTC CAACATTACC 


71 


GCCATGTTGA 


CATTGATTAT 


TGACTAGTTA 


TTAATAGTAA 


TCAATTACGG 


GGTCATTAGT 


TCATAGCCCA 


141 

X M X 


TATATC£2AGT 
mini xjvjrvva x 




A I AACTTACG 


GTAAATGGCC 


CGCCTGGCTG 


ACCGCCCAAC 


GACCCCCGCC 


211 


CATTGACGTC 


AATAATCACn 


TATGTTCCCA 


TAGTAACGCC 


AATAGGGACT 


TTCCATTCAC 


GTCAATGGGT 


281 


GGAGTATTTA 


CGGTAAACTG 


CCCACTTGGC 


AGTACATCAA 


kj 1 oTATCATA 


TGCCAAGTAC 


GCCCCCTATT 


351 


GACGTCAATG 


ACGGTAAATG 


GCCCGCCTGG 


CATTATGCCC 


ArTA C A Tr A f 

A\j x AUAXGAC 


CTTATGGGAC 


TTTCCTACTT 


421 


GGCAGTACAT 


CTACGTATTA 


GTCATCGCTA 


TTACCATGGT 


bATGuGGTTT 


TGGCAGTACA 


TCAATGGGCG 


491 


TGGATAGCGG 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


TTTGTTTTGG 


561 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


CCGCCCCATT 


GACGCAAATG 


GGCGGTAGGC 


631 


GTGTACGGTG 


GGAGGTCTAT 


ATAAGCAGAG 


CTCGTTTAGT 


GAACCGTCAG 


ATCGCCTGGA 


GACGCCATCC 


701 


ACGCTGTTTT 


GACCTCCATA 


GAAGACACCG 


GGACCGATCC 


AGCCTCCGCG 


Sail (758) 
GGCGCGCGTC GACAGAGAGA 


771 


TGGGTGCGAG 


AGCGTCAGTA 


TTAAGCGGGG GAGAATTAGA 


TCGATGGGAA 


AAAATTCGGT 


TAAGGCCAGG 


841 


GGGAAAGAAG 


AAGTACAAGC 


TAAAGCACAT 


CGTATGGGCA 


AGCAGGGAGC 


TAGAACGATT 


CGCAGTTAAT 


911 


CCTGGCCTGT 


TAGAAACATC 


AGAAGGCTGT 


AGACAAATAC 


TGGGACAGCT 


ACAACCATCC 


CTTCAGACAG 


981 


GATCAGAGGA 


GCTTCGATCA 


CTATACAACA 


CAGTAGCAAC 


CCTCTATTGT 


GTGCACCAGC 


GGATCGAGAT 


1051 


CAAGGACACC 


AAGGAAGCTT 


TAGACAAGAT 


AGAGGAAGAG 


CAAAACAAGT 


CCAAGAAGAA 


GGCCCAGCAG 


1121 


GCAGCAGCTG 


ACACAGGACA 


CAGCAATCAG 


GTCAGCCAAA 


ATTACCCTAT 


AGTGCAGAAC 


ATCCAGGGGC 


1191 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAATGC 


ATGGGTAAAA 


GTAGTAGAAG 


AGAAGGCTTT 


1261 


CAGCCCAGAA 


GTGATACCCA 


TGTTTTCAGC 


ATTATCACAA GGAGCCACCC 


CACAGGACCT 


GAACACGATG 


1331 


TTGAACACCG 


TGGGGGGACA 


TCAAGCAGCC 


ATGCAAATGT 


TAAAAGAGAC 


CATCAATGAG 


GAAGCTGCAG 


1401 


AATGGGATAG 


AGTGCATCCA 


GTGCATGCAG 


GGC CTATTGC 


ACCAGGCCAG 


ATGAGAGAAC 


CAAGGGGAAG 


1471 


TGACATAGCA 


GGAACTACTA . 


GTACCCTTCA 


GGAACAAATA 


GGATGGATGA 


CAAATAATCC 


ACCTATCCCA 


1541 


GTAGGAGAGA 


TCTACAAGAG 


GTGGATAATC 


CTGGGATTGA 


ACAAGATCGT 


GAGGATGTAT 


AGCCCTACCA 


1611 


GCATTCTGGA 


CATAAGACAA 


GG ACCAAAGG 


AACCCTTTAG 


AGACTATGTA 


GACCGGTTCT 


ATAAAACTCT 


1681 


AAGAGCTGAG 


CAAGCTTCAC 


AGGAGGTAAA 


AAATTGGATG 


ACAGAAACCT 


TGTTGGTCCA 


AAATGCGAAC 


1751 


CCAGATTGTA 


AGACCATCCT 


GAAGGCTCTC 


GGCCCAGGGG 


CTACACTAGA 


AGAAATGATG 


ACAGCATGTC 


1821 


AGGGAGTAGG 


AGGACCCGGC 


CATAAGGCAA 


GAGTTTTGGC 


CGAGGCGATG 


AGCCAGGTGA 


CGAACTCGGC 
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1891 


GACCATAATG ATGCAGAGAG GCAACTTCCG GAACCAGCGG 


AAGATCGTCA AGTGCTTCAA TTGTGGCAAA 


1961 


GAAGGGCACA 


CCGCCAGGAA 


CTGCCGGGCC 


CCCCGGAAGA 


AGGGCTGTTG 


GAAATGTGGA AAGGAAGGAC 


2031 


ACCAAATGAA AGATTGTACT GAGAGACAGG CTAATTTTTT 


AGGGAAGATC 


TGGCCTTCCT ACAAGGGAAG 


2101 


GCCAGGGAAT TTTCTTCAGA GCAGACCAGA GCCAACAGCC 


CCACCAGAAG 


AGAGCTTCAG 


GTCTGGGGTA 


2171 


GAGACAACAA 


CTCCCCCTCA 


GAAGCAGGAG 


CCGATAGACA AGGAACTGTA TCCTTTAACT 


TCCCTCAGAT 


2241 


CACTCTTTGG 


CAACGACCCC 


TCGTCACAGT 


AAGGATCGGG 


GGGCAACTCA 


AGGAAGCGCT 


GCTCGATACA 


2311 


GGAGCAGATG 


ATACAGTATT 


AGAAGAAATG 


AGTTTGCCAG 


GAAGATGGAA 


ACCAAAAATG 


ATAGGGGGGA 


2381 


TCGGGGGCTT 


CATCAAGGTG 


AGGCAGTACG 


ACCAGATACT 


CATAGAAATC 


TGTGGACATA AAGCTATAGG 


2451 


TACAGTATTA 


GTAGGACCTA 


CACCTGTCAA 


CATAATTGGA 


AGAAATCTGT TGACCCAGAT CGGCTGCACC 


2521 


TTGAACTTCC 


CCATCAGCCC 


TATTGAGACG 


GTGCCCGTGA 


AGTTGAAGCC 


GGGGATGGAC 


GGCCCCAAGG 


2591 


TCAAGCAATG 


GCCATTGACG 


AAAGAGAAGA 


TCAAGGCCTT 


AGTCGAAATC 


TGTACAGAGA TGGAGAAGGA 


2661 


AGGGAAGATC 


AGCAAGATCG 


GGCCTGAGAA 


CCCCTACAAC 


ACTCCAGTCT 


TCGCAATCAA GAAGAAGGAC 


2731 


AGTACCAAGT 


GGAGAAAGCT 


GGTGGACTTC 


AGAGAGCTGA 


ACAAGAGAAC 


TCAGGACTTC 


TGGGAAGTTC 


2801 


AGCTGGGCAT 


CCCACATGCC 


GCTGGGTTGA 


AGAAGAAGAA 


GTCAGTGACA 


GTGCTGGATG 


TGGGTGATGC 


2871 


CTACTTCTCC 


GTTCCCTTGG 


ACGAGGACTT 


CAGGAAGTAC 


ACTGCCTTCA 


CGATACCTAG 


CATCAACAAC 


2941 


GAGACACCAG 


GCATCCGCTA 


CCAGTACAAC 


GTGCTGCCAC 


AGGGATGGAA 


GGGATCACCA 


GCCATCTTTC 


3011 


AAAGCAGCAT 


GACCAAGATC 


CTGGAGCCCT 


TCCGCAAGCA 


AAACCCAGAC 


ATCGTGATCT 


ATCAGTACAT 


3081 


GGACGACCTC 


TACGTAGGAA 


GTGACCTGGA 


GATCGGGCAG 


CACAGGACCA 


AGATCGAGGA 


GCTGAGACAG 


3151 


CATCTGTTGA 


GGTGGGGACT 


GACCACACCA 


GACAAGAAGC 


ACCAGAAGGA 


ACCTCCCTTC 


CTGTGGATGG 


3221 


GCTACGAACT 


GCATCCTGAC 


AAGTGGACAG 


TGCAGCCCAT 


CGTGCTGCCT 


GAGAAGGACA 


GCTGGACTGT 


3291 


GAACGACATA 


CAGAAGCTCG 


TGGGCAAGTT 


GAACTGGGCA 


AGCCAGATCT 


ACCCAGGCAT 


CAAAGTTAGG 


3361 


CAGCTGTGCA 


AGCTGCTTCG 


AGGAACCAAG 


GCACTGACAG 


AAGTGATCCC 


ACTGACAGAG 


GAAGCAGAGC 


3431 


TAGAACTGGC 


AGAGAACCGA 


GAGATCCTGA 


AGGAGCCAGT 


ACATGGAGTG 


TACTACGACC 


CAAGCAAGGA 


3501 


CCTGATCGCA 


GAGATCCAGA AGCAGGGGCA 


AGGCCAATGG 


ACCTACCAAA 


TCTACCAGGA 


GCCCTTCAAG 


3571 


AACCTGAAGA 


CAGGCAAGTA 


CGCAAGGATG 


AGGGGTGCCC 


ACACCAACGA 


TGTGAAGCAG 


CTGACAGAGG 


3641 


CAGTGCAGAA 


GATCACCACA GAGAGCATCG 


TGATCTGGGG 


CAAGACTCCC 


AAGTTCAAGC 


TGCCCATACA 


3711 


GAAGGAGACA 


TGGGAGACAT 


GGTGGACCGA GTACTGGCAA 


GCCACCTGGA 


TCCCTGAGTG 


GGAGTTCCTG 
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3781 


AACACCCCTC 


CCTTGGTGAA ACTGTGGTAT CAGCTGGAGA AGGAACCCAT 


PGTGfiftARPA CAr*A*"v* f Mv*i* 


3851 


ACGTGGATGG 


GGCAGCCAAC 


AGGGAGACCA 


AGCTGGGCAA 


GGCAGGCTAC 


\j i Uft WjMAL L> oAv(jACGACA 


3921 


GAAAGTGGTG 


ACCCTGACTG 


ACACCACCAA 


CCAGAAGACT 


GAGCTGCAAG 


PP ATPT A PPT Af % f % 1*f*i*f*f % a a 


3991 


GACAGCGGAC 


TGGAAGTGAA 


CATCGTGACA 


GACTCACAGT 


ACGCACTGGG 


PATPATPPAA fTAfA a r»r» a n 


4061 


ACCAATCCGA 


GTCAGAGCTG 


GTGAACCAGA 


TCATCGAGCA 


GCTGATCAAG 


AAuliAbAAAb ltiTACCTGGC 


4131 


ATGGGTACCA 


GCACACAAAG 


GAATTGGAGG 


AAATGAACAA GTAGATAAAT 


x *\Vi l wno 1 Ob 1 IHjU ATCCGG 


4201 


AAGGTGCTGT 


TCCTGGACGG 


GATCGATAAG 


GCCCAAGATG 


AACATGAGAA 


ulALLAtncC AACTGGCGCG 


4271 


CTATGGCCAG 


CGACTTCAAC 


CTGCCACCTG 


TAGTAGCAAA 


AGAAATAGTA 


GCCAGCTGTG ATAAATGTCA 


4341 


GCTAAAAGGA 


GAAGCCATGC 


ATGGACAAGT 


AGACTGTAGT 


CCAGGAATAT 


GGCAGCTCGA CTGCACGCAC 


4411 


CTGGAGGGGA 


AGGTGATCCT 


GGTAGCAGTT 


CATGTAGCCA 


GTGGATATAT 


AGAAGCAGAA GTTATCCCTG 


4481 


CTGAAACTGG 


GCAGGAAACA 


GCATATTTTC 


TTTTAAAATT 


AGCAGGAAGA 


TGGCCAGTAA AAACAATACA 


4551 


CACCGACAAC 


GGAAGCAACT 


TCACTGGTGC 


TACGGTTAAG 


GCCGCCTGTT 


GGTGGGCGGG AATCAAGCAG 


4621 


GAATTTGGAA 


TTCCCTACAA 


TCCCCAATCG 


CAAGGAGTCG 


TGGAGAGCAT 


GAACAAGGAG CTGAAGAAGA 


4691 


TCATCGGACA 


AGTGAGGGAT 


CAGGCTGAGC 


ACCTGAAGAC 


AGCAGTGCAG 


ATGGCAGTGT TCATCCACAA 


4761 


CTTCAAAAGA 


AAAGGGGGGA 


TTGGGGGGTA 


CAGTGCAGGG 


GAAAGGATCG 


TGGACATCAT CGCCACCGAC 


4831 


ATCCAAACCA 


AGGAGCTGCA 


GAAGCAGATC 


ACCAAGATCC 


AGAACTTCCG 


GGTGTACTAC CGCGACAGCC 


4901 


GCAACCCACT 


GTGGAAGGGA 


CCAGCAAAGC 


TCCTCTGGAA 


GGGAGAGGGG 


GCAGTGGTGA TCCAGGACAA 


4971 


CAGTGACATC 


AAAGTGGTGC 


CAAGGCGCAA 


GG CC AAGATC 


ATCCGCGACT ATGGAAAACA GATGGCAGGT 


5041 


GATGATTGTG 


TGGCAAGTAG 


ACAGGATGAG 


GATTAGAACC 


TGGAAGAGCC 


TGGTGAAGCA CCATATGGCG 


5111 


Nhel (5117) 
BstBI (5111) 
TTCGAAGCTA GCCTCGAGAT 


CCAGATCTGC 


TGTGCCTTCT 


AGTTGCCAGC 


CATCTGTTGT TTGCCCCTCC 


5181 


CCCGTGCCTT 


CCTTGACCCT 


GGAAGGTGCC 


ACTCCCACTG 


TCCTTTCCTA ATAAAATGAG GAAATTGCAT 


5251 


CGCATTGTCT 


GAGTAGGTGT 


CATTCTATTC 


TGGGGGGTGG 


GGTGGGGCAG 


CACAGCAAGG GGGAGGATTG 


5321 


GGAAGACAAT 


AGCAGGCATG 


CTGGGGATGC 


GGTGGGCTCT 


ATGCGTACCC 


AGGTGCTGAA GAATTGACCC 


5391 


GGTTCCTCCT 


GGGCCAGAAA 


GAAGCAGGCA 


CATCCCCTTC 


TCTCTGACAC 


ACCCTGTCCA CGCCCCTGGT 


5461 


TCTTAGTTCC 


AGCCCCACTC 


ATAGGACACT 


CATAGCTCAG 


GAGGGCTCCG 


CCTTCAATCC CACCCGCTAA 


5531 


AGTACTTGGA 


GCGGTCTCTC 


CCTCCCTCAT 


CAGCCCACCA 


AACCAAACCT 


AGCCTCCAAG AGTGGGAAGA 
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5601 AATTAAAGCA AGATAGGCTA TTAAGTGCAG AGGGAGAGAA AATCCCTCCA ACATGTGAGG AAGTAATGAG 



5671 AGAAATCATA GAATTTCTTC CGCTTCCTCG CTCACTGACT CGCTGCGCTC GGTCGTTCGG CTCCGGCGAG - 



57 A 1 CGGTATCAGC TCACTCAAAG GCGGTAATAC GGTTATCCAC AGAATCAGGG GATAACGCAG GAAAGAACAT 
5811 GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA CCGTAAAAAG GCCGCGTTGC TGGCGTTTTT CCATAGGCTC 
5881 CGCCCCCCTG ACGAGCATCA CAAAAATCGA CGCTCAAGTC AGAGGTGGCG AAACCCGACA GGACTATAAA 
5951 GATACCAGGC GTTTCCCCCT GGAAGCTCCC TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
6021 CCTGTCCGCC TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT CAATGCTCAC GCTGTAGGTA TCTCACTTCG 
6091 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA GCCCGACCGC TGCGCCTTAT 
6161 CCGGTAACTA TCGTCTTGAG TCCAACCCGG TAAGACACGA CTTATCGCCA CTGGCAGCAG CCACTGGTAA 
6231 CAGGATTAGC AGAGCGAGGT ATGTAGGCGG TGCTACAGAG TTCTTGAAGT GGTGGCCTAA CTACGGCTAC 
6301 ACTAGAAGGA CAGTATTTGG TATCTGCGCT CTGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT 
6371 CTTGATCCGG CAAACAAACC ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 
6441 AAAAAAAGGA TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG CTCAGTGGAA CGAAAACTCA 
6511 CGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT CCTTTTAAAT TAAAAATGAA 
6581 GTTTTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC TGACAGTTAC CAATGCTTAA TCAGTGAGGC 
6651 ACCTATCTCA GCGATCTGTC TATTTCGTTC ATCCATAGTT GCCTGACTCC GGGGGGGGGG GGCGCTGAGG 
6721 TCTGCCTCGT GAAGAAGGTG TTGCTGACTC ATACCAGGCC TGAATCGCCC CATCATCCAG CCAGAAAGTG 
6791 AGGGAGCCAC GGTTGATGAG AGCTTTGTTG TAGGTGGACC AGTTGGTGAT TTTGAACTTT TCCTTTGCCA 
6861 CGGAACGGTC TGCGTTGTCG GGAAGATGCG TGATCTGATC CTTCAACTCA GCAAAAGTTC GATTTATTCA 
6931 ACAAAGCCGC CGTCCCGTCA AGTCAGCGTA ATGCTCTGCC AGTGTTACAA CCAATTAACC AATTCTGATT 
7001 AGAAAAACTC ATCGAGCATC AAATGAAACT GCAATTTATT CATATCAGGA TTATCAATAC CATATTTTTG 
271 «PhePheGlu AspLeuMecL euHisPheGl nLeuLysAsn MetAspProA snAspIleGl yTyrLysGln 
7071 AAAAAGCCGT TTCTGTAATG AAGGAGAAAA CTCACCGAGG CAGTTCCATA GGATGGCAAG ATCCTGGTAT 

248 « PheLeuArgL ysGlnLeuSe rProSerPhe GluGlyLeuC ysAsnTrpLe uIleAlaLeu AspGlnTyrA 
7141 CGGTCTGCGA TTCCGACTCG TCCAACATCA ATACAACCTA TTAATTTCCC CTCGTCAAAA ATAAGGTTAT 

224 « rgAspAlall eGlyValArg GlyValAspI leCysGlyll eLeuLysGly GluAspPhel leLeuAsnAs 
7211 CAAGTGAGAA ATCACCATGA GTGACGACTG AATCCGGTGA GAATGGCAAA AGCTTATGCA TTTCTTTCCA 

201 « pLeuSerPhe AspGlyHisT hrValValSe rAspProSer PheProLeuL euLysHisMe tGluLysTrp 
7281 GACTTGTTCA ACAGGCCAGC CATTACGCTC GTCATCAAAA TCACTCGCAT CAACCAAACC GTTATTCATT 

178 4 ValGlnGluV alProTrpGl yAsnArgGlu AspAspPheA spSerAlaAs pValLeuGly AsnAsnMetA 
7351 CGTGATTGCG CCTGAGCGAG ACGAAATACG CGATCGCTGT TAAAAGGACA ATTACAAACA GGAATCGAAT 

154 « rgSerGlnAl aGlnAlaLeu ArgPheValA rgAspSerAs nPheProCys AsnCysValP roIleSerHi 
7421 GCAACCGGCG CAGGAACACT GCCAGCGCAT CAACAATATT TTCACCTGAA TC AG GAT ATT CTTCTAATAC 

131 4 sLeuArgArg LeuPheValA laLeuAlaAs pVallleAsn GluGlySerA spProTyrGl uGluLeuVal 
7491 CTGGAATGCT GTTTTCCCGG GGATCGCAGT GGTGAGTAAC CATGCATCAT CAGGAGTACG GATAAAATGC 

108 4 GlnPheAlaT hrLysGlyPr oIleAlaThr ThrLeuLeuT rpAlaAspAs pProThrArg IlePheHisL 
7561 TTGATGGTCG GAAGAGGCAT AAATTCCGTC AGCCAGTTTA GTCTGACCAT CTCATCTGTA ACATCATTGG 
84 4 ysIleThrPr oLeuProMet PheGluThrL euTrpAsnLe uArgValMet GluAspThrV alAspAsnAl 
7631 CAACGCTACC TTTGCCATGT TTCAGAAACA ACTCTGGCGC ATCGGGCTTC CCATACAATC GATAGATTGT 
61 « aValSerGly LysGlyHisL ysLeuPheLe uGluProAla AspProLysG lyTyrLeuAr gTyrlleThr 
7701 CGCACCTGAT TGCCCGACAT TATCGCGAGC CCATTTATAC CCATATAAAT CAGCATCCAT GTTGGAATTT 
38 « AlaGlySerG InGlyValAs nAspArgAla TrpLysTyrG lyTyrLeuAs pAlaAspMet AsnSerAsnL 
7771 AATCGCGGCC TCGAGCAAGA CGTTTCCCGT TGAATATGGC TCATAACACC CCTTGTATTA CTGTTTATGT 

14 4 euArgProAr gSerCysSer ThrGluArgG InlleHisSe rMet 
7841 AAGCAGACAG TTTTATTGTT CATGATGATA TATTTTTATC TTGTGCAATG TAACATCAGA GATTTTGAGA 
7911 CACAACGTGG CTTTCCCCCC CCCCCCATTA TTGAAGCATT TATCAGGGTT ATTGTCTCAT GAGCGGATAC 
7981 ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC CCCGCACATT TCCCCGAAAA GTGCCACCTG 
8051 ACGTCTAAGA AACCATTATT ATCATGACAT TAACCTATAA AAATAGGCGT ATCACGAGGC CCTTTCGTCT 
8121 CGCGCGTTTC GGTGATGACG GTGAAAACCT CTGACACATG CAGCTCCCGG AGACGGTCAC AGCTTGTCTG 
8191 TAAGCGGATG CCGGGAGCAG ACAAGCCCGT CAGGGCG CGT CAGCGGGTGT TGGCGGGTGT CGGGGCTGGC 
8261 TTAACTATGC GGCATCAGAG CAGATTGTAC GTAGAGTGCA CCATATGCGG TGTGAAATAC CGCACAGATG 
8331 CGTAAGGAGA AAATACCGCA TCAGATTGGC TATTGG 
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1 

X 


TfiPAAPPPPT 






/UibAtjA X i 


1 (j/vlt/xljxL»G 


ATCTACCACA 


CACAAGGCTA 




CTTCCCTGAT 


TGGPAGAAPT 


AP AP APP A HP 


PPP APPP A TP 


APATATPPAP 


TGACCTTTGG ATGGTGCTTC 


141 

i T i. 


AAP.TTAPT A P 


C AHTTRA ATP 


APAPP A APT A 


PA AP A POOP A 


A ATA APP AP A 


GAAGAACAGC 


TTGTTACACC 


211 


CTATGAGCCA 


GCATGGGATG 


GAGGAGCCGG 


AGGGAGAAGT 


ATTAGTGTGG 


AAGTTTGACA 


GCCTGCTAGC 


281 


ATTTCGTCAC 


ATGGCCCGAG 


AGCTGCATCC 


GGAGTACTAC 


AAAGACTGCT 


GACATCGAGC 


TTTCTACAAG 


351 


GGACTTTCCG 


CTGGGGACTT 


TCCAGGGAGG 


TGTGGCCTGG 


GCGGGACTGG 


GGAGTGGCGA 


GCCCTCAGAT 


421 


GCTACATATA 


AGCAGCTGCT 


TTTTGCCTGT 


ACTGGGTCTC 


TCTGGTTAGA 


CCAGATCTGA GCCTGGGAGC 


491 


TCTCTGGCTA 


ACTAGGGAAC 


CCACTGCTTA 


AGCCTCAATA 


AAGCTTGCCT 


TGAGTGCTCA 


AAGTAGTGTG 


561 


TGCCCGTCTG 


TTGTGTGACT 


CTGGTAACTA 


GAGATCCCTC 


AGACCCTTTT 


AGTCAGTGTG 


GAAAATCTCT 


631 


AGCAGTGGCG 


CCCGAACAGG 


GACTTGAAAG 


CGAAAGTAAA 


GCCAGAGGAG 


ATCTCTCGAC 


GCAGGACTCG 


701 


GCTTGCTGAA 


BssHII (711) 
GCGCGCacgg caagaggcga 


ggggcggcgC 


ctgACgagGa 


cgccaaaaac 


♦ 

cttgactagc 


771 


ggaggccaga 


aggagagagC TCGGTGCGAG AGCGTCAGTA TCAAGCGGGG 


Clal (830) 

GAGAATTAGA TCGATGGGAA 


















841 


AAAATTCGGT 


TAAGGCCAGG 


GGGAAAGAAA 


AAATATAAAT 


TAAAACATAT 


AGTATGGGCA 


AGCAGGGAGC 


911 


TAGAACGATT 


CGCAGTTAAT 


CCTGGCCTGT 


TAGAAACATC 


AccI (959) 

AGAAGGCTGT AGACAAATAC 


• 

TGGGACAGCT 


981 


ACAACCATCC 


CTTCAGACAG 


GATCAGAAGA 


ACTTAGATCA 


TTATATAATA 


CAGTAGCAAC 


CCTCTATTGT 


1051 


GTGCATCAAA 


GGATAGAGAT 


AAAAGACACC 


AAGGAAGCTT 


TAGACAAGAT 


AGAGGAAGAG 


CAAAACAAAA 



FIG. 10 
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1121 


GTAAGAAAAA 


AGCACAGCAA 


GCAGCAGCTG 


ACACAGGACA 


CAGCAATCAG 


GTCAGCCAAA ATTACCCTAT 


1191 


AGTGCAGAAC 


ATCCAGGGGC 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAACGA 


TAAGCTTGGG 


1261 


AGTTCCGCGT 


TACATAACTT 


ACGGTAAATG 


GCCCGCCTGG 


CTGACCGCCC 


AACGACCCCC 


GCCCATTGAC 


1331 


GTCAATAATG 


ACGTATGTTC 


CCATAGTAAC 


GCCAATAGGG 


ACTTTCCATT 


GACGTCAATG 


GGTGGAGTAT 


1401 


TTACGGTAAA 


CTGCCCACTT 


GGCAGTACAT 


CAAGTGTATC 


ATATGCCAAG 


TACGCCCCGT 


ATTGACGTCA 


1471 


ATGACGGTAA 


ATGGCCCGCC 


TGGCATTATG 


CCCAGTACAT 


GACCTTATGG 


GACTTTCCTA 


CTTGGCAGTA 


1541 


CATCTACGTA 


TTAGTCATCG 


CTATTACCAT 


GGTGATGCGG 


TTTTGGCAGT 


ACATCAATGG 


GCGTGGATAG 


1611 


CGGTTTGACT 


CACGGGGATT 


TCCAAGTCTC 


CACCCCATTG 


ACGTCAATGG 


GAGTTTGTTT 


TGGCACCAAA 


1681 


ATCAACGGGA 


CTTTCCAAAA 


TGTCGTAACA 


ACTCCGCCCC 


ATTGACGCAA 


ATGGGCGGTA 


GGCGTGTACG 


1751 


GTGGGAGGTC 


TATATAAGCA 


GAGCTCGTTT 


AGTGAACCGT 


CAGATCGCCT 


GGAGACGCCA 


TCCACGCTGT 


1821 


TTTGACCTCC 


ATAGAAGACA 


CCGACTCTAG AGgatccATC 


TAAGTAAGCT 


TGGCATTCCG 


GTACTGTTGG 


1891 


TAAAATGGAA 


GACGCCAAAA 


ACATAAAGAA 


AGGCCCGGCG 


CCATTCTATC 


CTCTAGAGGA 


TGGAAGCGCT 


1961 


GGAGAGCAAC 


TGCATAAGGC 


TATGAAGAGA 


TACGCCCTGG 


TTCCTGGAAC 


AATTGCTTTT 


ACAGATGCAC 


2031 


ATATCGAGGT 


GAACATCACG 


TACGCGGAAT 


ACTTCGAAAT 


GTCCGTTCGC 


TTGGCAGAAG 


CTATGAAACG 
• 


2101 


ATATGGGCTG 


AATACAAATC 


ACAGAATCGT 


CGTATGCAGT 


GAAAACTCTC 


TTCAATTCTT 


TATGCCCGTG 


2171 


TTGGGCCCGT 


TATTTATCGG 


AGTTGCAGTT 


GCGCCCGCGA 


ACGACATTTA 


TAATGAACGT 


GAATTGCTCA 


2241 


ACAGTATGAA 


CATTTCGCAG 


CCTACCGTAG 


TGTTTGTTTC 


CAAAAAGGGG 


TTGCAAAAAA 


TTTTGAACGT 


2311 


GCAAAAAAAA 


TTACCAATAA 


TCCAGAAAAT 


TATTATCATG 


GATTCTAAAA 


CGGATTACCA 


GGGATTTCAG 
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2381 TCGATGTACA CGTTCGTCAC ATCTCATCTA CCTCCCGGTT TTAATGAATA CGATTTTCTA CCAGAGTCCT 



2451 


TTGATCGTGA 


CAAAAGAATT 


GCACTGATAA 


TGAATTCCTC 


TGGATCTACT GGGTTACCTA AGGGTGTGGC 


2521 


CCTTCCGCAT 


AGAACTCCCT 


GCGTCAGATT 


CTCGCATGCC 


AGAGATCCTA 


TTTTTGGCAA 


TCAAATCATT 


2591 


CCGGATACTG 


CGATTTTAAG 


TGTTGTTCCA 


TTCCATCACG 


GTTTTGGAAT GTTTACTACA CTCGGATATT 


2661 


TGATATGTGG 


ATTTCGAGTC 


GTCTTAATGT 


ATAGATTTGA 


AGAAGAGCTG 


TTTTTACGAT 


CCCTTCAGGA 


2731 


TTACAAAATT 


CAAAGTGCGT 


TGCTAGTACC 


AACCCTATTT 


TCATTCTTCG 


CCAAAAGCAC 


TCTGATTGAC 


2801 


AAATACGATT 


TATCTAATTT 


ACACGAAATT 


GCTTCTGGGG 


GCGCACCTCT 


TTCGAAAGAA 


GTCCGGGAAG 


2871 


CGGTTGCAAA 


ACGCTTCCAT 


CTTCCAGGGA 


TACGACAAGG 


ATATGGGCTC 


ACTGAGACTA 


CATCAGCTAT 


2941 


TCTGATTACA 


CCCGAGGGGG 


ATH ATA A A CC 


GGGCGCGGTC 


GGTAAAGTTG 


TTCCATTTTT 


TGAAGCGAAG 


3011 


GTTGTGGATC 


TGGATACCGG 


GAAAACGCTG 


GGCGTTAATC 


AGAGAGGCGA 


ATTATGTGTC 


AGAGGACCTA 


3081 


TGATTATGTC 


CGGTTATGTA 


AACAATCCGG 


AAGCGACCAA 


CGCCTTGATT 


GACAAGGATG 

• 


GATGGCTACA 


3151 


TTGTGGAGAC 


ATAGCTTACT 


GGGACGAAGA 


CGAACACTTC 


TTCATAGTTG 


ACCGCTTGAA 


GTCTTTAATT 


3221 


AAATACAAAG 


GATATCAGGT 


GGCCCCCGCT 


Clal (3259) 
GAATTGGAAT CGATATTGTT 


ACAACACCCC 


AACATCTTCG 


3291 


ACGCGGGCGT 


GGCAGGTCTT 


CCCGACGATG 


ACGCCGGTGA 


ACTTCCCGCC 


GCCGTTGTTG 


TTTTGGAGCA 


3361 


CGGAAAGACG 


ATGACGGAAA 


AAGAGATCGT 


GGATTACGTC 


GCCAGTCAAG 


TAACAACCGC 


GAAAAAGTTG 


3431 


CGCGGAGGAG 


TTGTGTTTGT 


GGACGAAGTA 


CCGAAAGGTC 


TTACCGGAAA 


ACTCGACCCA 


AGAAAAATCA 


3501 


GAGAGATCCT 


CATAAAGGCC 


AAGAAGGGCG 


GAAAGTCCAA 


Apal 
Xhol (3548) 
ATTCTAAcTC GAGGGGGGGC 


(3557) 
Kpnl(3563 
CCGGTACCTT 



! 
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3571 


TAAGACCAAT 


GACTTACAAG 


GCAGCTGTAG 


ATCTTAGCCA 


CTTTTTAAAA 


GAAAAGGGGG 


GACTGGAAGG 


3641 


GCTAATTCAC 


TCCCAAAGAA 


GACAAGATAT 


CCTTGATCTG 


TGGATCTACC 


ACACACAAGG 


CTACTTCCCT 


3711 


GATTGGCAGA 


ACTACACACC 


AGGGCCAGGG 


GTCAGATATC 


CACTGACCTT 


TGGATGGTGC 


TACAAGCTAG 


3781 


TACCAGTTGA 


GCCAGATAAG 


GTAGAAGAGG 


CCAATAAAGG 


AGAGAACACC 


AGCTTGTTAC 


ACCCTGTGAG 


3851 


CCTGCATGGA 


ATGGATGACC 


CTGAGAGAGA 


AGTGTTAGAG 


TGGAGGTTTG 


ACAGCCGCCT 


AGCATTTCAT 


3921 


CACGTGGCCC 


GAGAGCTGCA 


TCCGGAGTAC 


TTCAAGAACT 


GCTGACATCG 


AGCTTGCTAC 


AAGGGACTTT 


3991 


CCGCTGGGGA 


CTTTCCAGGG 


AGGCGTGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


GATGCTGCAT 


4061 


ATAAGCAGCT 


GCTTTTTGCC 


TGTACTGGGT 


CTCTCTGGTT 


AGACCAGATC 


TGAGCCTGGG 


AGCTCTCTGG 


4131 


CTAACTAGGG 


AACCCACTGC 


TTAAGCCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


TTCAAGTAGT 


GTGTGCCCGT 


4201 


CTGTTGTGTG 


ACTCTGGTAA 


CTAGAGATCC 


CTCAGACCCT 


TTTAGTCAGT 


GTGGAAAATC 


TCTAGCACCC 



4271 CCCAGGAGGT AGAGGTTGCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 

4341 TGTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TTTATACATG GAGGTCATAA AAATATATAT 

4411 ATTTGGGCTG GGCGCAGTGG CTCACACCTG CGCCCGGCCC TTTGGGAGGC CGAGGCAGGT GGATCACCTG 

4481 AGTTTGGGAG TTCCAGACCA -GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATTTTT ATGAGATTTT 

4551 ATTTTATGTG TATTTTATTC ACAGGTATTT CTGGAAAACT GAAACTGTTT TTCCTCTACT CTGATACCAC 

4621 AAGAATCATC AGCACAGAGG AAGACTTCTG TGATCAAATG TGGTGGGAGA GGGAGGTTTT CACCAGCACA 

4691 TGAGCAGTCA GTTCTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTCGAGA 

4761 GAGGTCAGAC CACAGGGTGA GGGCTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CCCAACAGGT 

4831 CCACCCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 

4901 CACAGAGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 

4971 TCAGAGTTTT TAAGGATAAC TTAGTGTGTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 

5041 GGTGTCCTTT TGCGCCGAGT CAGTTCCTGG GTGGGGGCCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 

5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTTTTA 

5181 CAATAGTGAT GTTACCCCAG GAACAATTTG GGGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 

5251 TAAACCATAA TTTCTTTTTT GTTTTTTTTT TTTTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 

5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 

5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 

5461 CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 

5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AATTGTAAAC GTTAATATTT 

5601 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 

5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA GAGTCCACTA 

5741 TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 

5811 CATCACCCTA ATCAAGTTTT TTGGGGTCGA CGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGACCCC 

5881 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 
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5951 GCCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 

6021 CGCTACAGGG CGCGTCCCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC 

6091 TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 

6161 GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 

6231 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 

6301 ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA ACAACGTTTT CCAATGATGA 

6371 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 

64 A 1 CCGCATACAC TATTCTCAGA ATGACTTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 

6511 ATGACAGTAA GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 

6581 CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 

6651 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 

6721 GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 

6791 GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCCCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 

6861 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 

6931 CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 

7001 TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 

7071 AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ACTTCATGAC CAAAATCCCT 

71 A 1 TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 

7211 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 

7281 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 

7351 CTAGTGTAGC CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTCCTAA 

7421 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 

7491 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC 

7561 TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 

7631 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 

7701 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 

7771 GGGCGGAGCC TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 

7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG AGTGAGCTGA 

7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 

7981 CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA CGACAGGTTT CCCGACTGGA 

8051 AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 

8121 TATGCTTCCG GCTCGTATGT TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 

8191 CATGATTACG CCAAGCTCGG AATTAACCCT CACTAAAGGG AACAAAAGCT GCTGCAGGGT CCCTAACTGC 

8261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC CTTCCTTCTA GCGGCTGCCC CCACTCGGCT TTGCTTTCCC 

8331 TAGTTTCAGT TACTTGCGTT CAGCCAAGGT CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTGCGAGAGA 

8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 

8471 ACATTGCACC CTGACAGTCG TCAGCCTCAC AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATTCCATTT 

8541 GATTCACAAT TTTTTTAGTC TCTACTGTGC CTAACTTGTA AGTTAAATTT GATCAGAGGT GTGTTCCCAG 

8611 AGGGG AAAAC AGTATATACA GGGTTCAGTA CTATCGCATT TCAGGCCTCC ACCTGGGTCT TGGAATGTGT 

8681 CCCCCGAGGG GTGATGACTA CCTCAGTTGG ATCTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 

8751 TCCCAAGGCT ACCACAATGG GCCGCCCTCC ACGTGCACAT GGCCGGAGGA ACTGCCATGT CGGAGGTGCA 

8821 AGCACACCTG CGCATCAGAG TCCTTGGTGT GGAGGGAGGG ACCAGCGCAG CTTCCAGCCA TCCACCTGAT 

8891 GAACAGAACC TAGGGAAAGC CCCAGTTCTA CTTACACCAG GAAAGGC 
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AGCAGTGGCG 


CCCGAACAGG GACTTGAAAG 


CGAAAGTAAA 


GCCAGAGGAG 


ATCTCTCGAC 


GCAGGACTCG 


701 
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Clal (830) 
GAGAATTAGA TCGATGGGAA 


















841 


AAAATTCGGT 


TAAGGCCAGG 


GGGAAAGAAG 


AAGTACAAGC 


TAAAGCACAT 


CGTATGGGCA 


AGCAGGGAGC 


911 


TAGAACGATT 


CGCAGTTAAT 


CCTGGCCTGT 


TAGAAACATC 


AccI (959) 
AGAAGGCTGT AGACAAATAC 


TGGGACAGCT 


981 


ACAACCATCC 


CTTCAGACAG 


GATCAGAGGA 


GCTTCGATCA 


CTATACAACA 


CAGTAGCAAC 


CCTCTATTGT 


1051 


GTGCACCAGC 


GGATCGAGAT 


CAAGGACACC 


AAGGAAGCTT 


TAGACAAGAT 


AGAGGAAGAG 


CAAAACAAGT 


1121 


CCAAGAAGAA 


GGCCCAGCAG 


GCAGCAGCTG 


ACACAGGACA 


CAGCAATCAG 


GTCAGCCAAA 


ATTACCCTAT 



FIG. 1 1 



WO 02/099101 



PCT/US02/17258 



30/45 



1191 


AGTGCAGAAC 


ATCCAGGGGC 


AAATGGTACA 


TCAGGCCATA 


TCACCTAGAA 


CTTTAAACGA 


TAAGCTTGGG 


1261 


AGTTCCGCGT 


TACATAACTT 


ACGGTAAATG 


GCCCGCCTGG 


CTGACCGCCC 


AACGACCCCC 


GCCCATTGAC 


1331 


GTCAATAATG 


ACGTATGTTC 


CCATAGTAAC 


GCCAATAGGG 


ACTTTCCATT 


GACGTCAATG 


GGTGCAGTAT 


1401 


TTACGGTAAA 


CTGCCCACTT 


GGCAGTACAT 


CAAGTGTATC 


ATATGCCAAG 


TACGCCCCCT 


ATTGACCTCA 


1471 


ATGACGGTAA 


ATGGCCCGCC 


TGGCATTATG 


CCCAGTACAT 


GACCTTATGG 


GACTTTCCTA 


CTTGGCAGTA 


1541 


CATCTACGTA 


TTAGTCATCG 


CTATTACCAT 


GGTGATGCGG 


TTTTGGCAGT 


ACATCAATGG 


GCGTGGATAG 


1611 


CGGTTTGACT 


CACGGGGATT 


TCCAAGTCTC 


CACCCCATTG 


ACGTCAATGG 


GAGTTTGTTT 


TGGCACCAAA 


1681 


ATCAACGGGA 


CTTTCCAAAA 


TGTCGTAACA 


ACTCCGCCCC 


ATTGACGCAA 


ATGGGCGGTA 


GGCGTGTACG 


1751 


GTGGGAGGTC 


TATATAAGCA 


GAGCTCGTTT 


AGTGAACCGT 


CAGATCGCCT 


GGAGACGCCA 


TCCACGCTGT 


1821 


TTTGACCTCC 


ATAGAAGACA 


CCGACTCTAG AGgatccATC TAAGTAAGCT TGGCATTCCG GTACTGTTGG 


1891 


TAAAATGGAA 


GACGCCAAAA 


ACATAAAGAA 


AGGCCCGGCG 


CCATTCTATC 


CTCTAGAGGA 


TGGAACCGCT 


1961 


GGAGAGCAAC 


TGCATAAGGC 


TATGAAGAGA 


TACGCCCTGG 


TTCCTGGAAC 


AATTGCTTTT 


ACAGATGCAC 


2031 


ATATCGAGGT 


GAACATCACG 


TACGCGGAAT 


ACTTCGAAAT 


GTCCGTTCGG 


TTGGCAGAAG 


CTATGAAACG 


2101 


ATATGGGCTG 


AATACAAATC 


ACAGAATCGT 


CGTATGCAGT 


GAAAACTCTC 


TTCAATTCTT 


TATGCCGGTG 


2171 


TTGGGCGCGT 


TATTTATCGG 


AGTTGCAGTT 


GCGCCCGCGA 


ACGACATTTA 


TAATGAACGT 


GAATTGCTCA 


2241 


ACAGTATGAA 


CATTTCGCAG 


CCTACCGTAG 


TGTTTGTTTC 


CAAAAAGGGG 


TTGCAAAAAA 


TTTTGAACGT 


2311 


GCAAAAAAAA 


TTACCAATAA 


TCCAGAAAAT 


TATTATCATG 


GATTCTAAAA 


CGGATTACCA 


GCGATTTCAG 



2381 



TCGATGTACA CGTTCGTCAC ATCTCATCTA CCTCCCGGTT TTAATGAATA CGATTTTGTA CCAGAGTCCT 
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2451 


TTGATCGTGA 


CAAAACAATT 


GCACTGATAA 


TGAATTCCTC 


TGGATCTACT GGGTTACCTA 


AGGGTGTGGC 


2521 


CCTTCCGCAT 


AGAACTGCCT 


GCGTCAGATT 


CTCGCATGCC 


AGAGATCCTA 


TTTTTGGCAA 


TCAAATCATT 


2591 


CCGGATACTG 


CGATTTTAAG 


TGTTGTTCCA 


TTCCATCACG 


GTTTTGGAAT 


GTTTACTACA 


CTCGGATATT 


2661 


TGATATGTGG 


ATTTCGAGTC 


GTCTTAATGT 


ATAGATTTGA 


AGAAGAGCTG 


TTTTTACGAT 


CCCTTCAGGA 


2731 


TTACAAAATT 


CAAAGTGCGT 


TGCTAGTACC 


AACCCTATTT 


TCATTCTTCG 


CCAAAAGCAG 


TCTGATTGAC 


2801 


AAATACGATT 


TATCTAATTT 


ACACGAAATT 


GCTTCTG GGG 


GCGCACCTCT 


TTCGAAAGAA 


GTCGGGGAAG 


2871 


CGGTTGCAAA 


ACGCTTCCAT 


CTTCCAGGGA 


TACGACAAGG 


ATATGGGCTC 


ACTGAGACTA 


CATCAGCTAT 


2941 


TCTGATTACA 


CCCGAGGGGG 


ATGATAAACC 


GGGCGCGGTC 


GGTAAAGTTG 


TTCCATTTTT 


TGAAGCGAAG 


3011 


GTTGTGGATC 


TGGATACCGG 


GAAAACGCTG 


GGCGTTAATC 


AGAGAGGCGA 


ATTATGTGTC 


AGAGGACCTA 


3081 


TGATTATGTC 


CGGTTATGTA 


AACAATCCGG 


AAGCGACCAA 


CGCCTTGATT 


GACAAGGATG 


GATGGCTACA 


3151 


TTCTGGAGAC 


ATAGCTTACT 


GGGACGAAGA 


CGAACACTTC 


TTCATAGTTG 


ACCGCTTGAA 


GTCTTTAATT 


i 

3221 


AAATACAAAG 


GATATCAGGT 


GGCCCCCGCT 


Clal (3259) 
GAATTGGAAT CGATATTGTT 


ACAACACCCC 


AACATCTTCG 


3291 


ACGCGGGCGT 


GGCAGGTCTT 


CCCGACGATG 


ACGCCGGTGA 


ACTTCCCGCC 


GCCGTTGTTG 


TTTTGGAGCA 


3361 


CGGAAAGACG 


ATGACGGAAA 


AAGAGATCGT 


GGATTACGTC 


• 

GCCAGTCAAG 


TAACAACCGC 


GAAAAAGTTG 



3431 CGCGGAGGAG TTGTGTTTGT GGACGAAGTA CCGAAAGGTC TTACCGGAAA ACTCGACGCA AGAAAAATCA 



3501 
3571 



Apal (3557) 
Xhol (3548) KpnX<3563) 
GAGAGATCCT CATAAAGGCC AAGAAGGGCG GAAAGTCCAA ATTGTAAcTC GAGGGGGGGC CCGGTACCTT 



TAAGACCAAT GACTTACAAG GCAGCTGTAG ATCTTAGCCA CTCTTTAAAA GAAAAGGGGG GACTGGAAGG 
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3641 


GCTAATTCAC 


TCCCAAAGAA 


GACAAGATAT 


CCTTGATCTG 


TGGATCTACC 


ACACACAAGG 


CTACTTCCCT 


3711 


GATTGGCAGA 


ACTACACACC 


AGGGCCAGGG 


GTCAGATATC 


CACTGACCTT 


TGGATGGTGC 


TACAAGCTAG 


3781 


TACCAGTTGA 


GCCAGATAAG 


GTAGAAGAGG 


CCAATAAAGG 


AGAGAACACC 


AGCTTGTTAC 


ACCCTGTGAG 


3851 


CCTGCATGGA 


ATGGATGACC 


CTGAGAGAGA 


AGTGTTAGAG 


TGGAGGTTTG 


ACAGCCGCCT 


AGCATTTCAT 


3921 


CACGTGGCCC 


GAGAGCTGCA 


TCCGGAGTAC 


TTCAAGAACT 


GCTGACATCG AGCTTGCTAC 


AAGGGACTTT 


3991 


CCGCTGGGGA 


CTTTCCAGGG 


AGGCGTGGCC 


TGGGCGGGAC 


TGGGGAGTGG 


CGAGCCCTCA 


GATGCTGCAT 


4061 


ATAAGCAGCT 


GCTTTTTGCC 


TGTACTGGGT 


CTCTCTGGTT 


AGACCAGATC 


TGAGCCTGGG 


AGCTCTCTGG 


4131 


CTAACTAGGG 


AACCCACTGC 


TTAAGCCTCA 


ATAAAGCTTG 


CCTTGAGTGC 


TTCMGTAGT 


GTGTGCCCGT 


4201 


CTGTTGTGTG 


ACTCTGGTAA 


CTAGAGATCC 


CTCAGACCCT 


TTTAGTCAGT 


G7GGAAAATC 


TCTAGCACCC 



4 271 CCCAGGAGGT AGAGGTTGCA GTGAGCCAAG ATCGCGCCAC TGCATTCCAG CCTGGGCAAG AAAACAAGAC 

4341 TGTCTAAAAT AATAATAATA AGTTAAGGGT ATTAAATATA TTTATACATG GAGGTCATAA AAATATATAT 

4411 ATTTGGGCTG GGCGCAGTGG CTCACACCTG CGCCCGGCCC TTTGGGAGGC CGAGGCAGGT GGATCACCTG 

4481 AGTTTGGGAG TTCCAGACCA GCCTGACCAA CATGGAGAAA CCCCTTCTCT GTGTATTTTT AGTAGATTTT 

4 551 ATTTTATGTG TATTTTATTC ACAGGTATTT CTGGAAAACT GAAACTGTTT TTCCTCTACT CTGATACCAC 

4621 AAGAATCATC AGCACAGAGG AAGACTTCTG TGATCAAATG TGGTGGGAGA GGGAGGTTTT CACCAGCACA 

4691 TGAGCAGTCA GTTCTGCCGC AGACTCGGCG GGTGTCCTTC GGTTCAGTTC CAACACCGCC TGCCTGGAGA 

4761 GAGGTCAGAC CACAGGGTGA GGG CTCAGTC CCCAAGACAT AAACACCCAA GACATAAACA CGCAACAGGT 

4831 CCACCCCGCC TGCTGCCCAG GCAGAGCCGA TTCACCAAGA CGGGAATTAG GATAGAGAAA GAGTAAGTCA 

4901 CACAGAGCCG GCTGTGCGGG AGAACGGAGT TCTATTATGA CTCAAATCAG TCTCCCCAAG CATTCGGGGA 

4971 TCAGAGTTTT TAAGGATAAC TTAGTGTGTA GGGGGCCAGT GAGTTGGAGA TGAAAGCGTA GGGAGTCGAA 

5041 GCTGTCCTTT TGCGCGGAGT CAGTTCCTGG GTGGGGG CCA CAAGATCGGA TGAGCCAGTT TATCAATCCG 

5111 GGGGTGCCAG CTGATCCATG GAGTGCAGGG TCTGCAAAAT ATCTCAAGCA CTGATTGATC TTAGGTTTTA 

5181 CAATAGTGAT GTTACCCCAG GAACAATTTG GGGAAGGTCA GAATCTTGTA GCCTGTAGCT GCATGACTCC 

5251 TAAACCATAA TTTCTTTTTT GTTTTTTTTT TTTTATTTTT GAGACAGGGT CTCACTCTGT CACCTAGGCT 

5321 GGAGTGCAGT GGTGCAATCA CAGCTCACTG CAGCCCCTAG AGCGGCCGCC ACCGCGGTGG AGCTCCAATT 

5391 CGCCCTATAG TGAGTCGTAT TACAATTCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 

5461 CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA AGAGGCCCGC 

5531 ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCGA AATTGTAAAC GTTAATATTT 

5601 TGTTAAAATT CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 

5671 CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTCTTCCAG TTTGGAACAA GAGTCCACTA 

5741 TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG CGATGGCCCA CTACGTGAAC 

5811 CATCACCCTA ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA AGCACTAAAT CGGAACCCTA AAGGGAGCCC 

5881 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG 

5951 GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC 

6021 CGCTACAGGG CGCGTCCCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC 

6091 TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 
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6161 GGAAGAGTAT GAGTATTCAA CATTTCCGTG 

6231 TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 

6301 ATCGAACTGG ATCTCAACAG CGGTAAGATC 

6371 GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 

6441 CCGCATACAC TATTCTCAGA ATGACTTGGT 

6511 ATGACAGTAA GAGAATTATG CAGTGCTGCC 

6581 CAACGATCGG AGGACCGAAG GAGCTAACCG 

6651 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT 

6721 GCAACAACGT TGCGCAAACT ATTAACTGGC 

6791 GGATGGAGGC GGATAAAGTT GCAGGACCAC 

6861 TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 

6931 CGTATCGTAG TTATCTACAC GACGGGGAGT 

7001 TAGGTGCCTC ACTGATTAAG CATTGGTAAC 

7071 AAAACTTCAT TTTTAATTTA AAAGGATCTA 

7141 TAACGTGAGT TTTCGTTCCA CTGAGCGTCA 

7211 TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 

7281 TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 

7351 CTAGTGTAGC CGTAGTTAGG CCACCACTTC 

7421 TCCTGTTACC AGTGGCTGCT GCCAGTGGCG 

7491 ACCGGATAAG GCGCAGCGGT CGGGCTGAAC 

7561 TACACCGAAC TGAGATACCT ACAGCGTGAG 

7631 ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 

7701 GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 

7771 GGGCGGAGCC TATGGAAAAA CGCCAGCAAC 

7841 CTCACATGTT CTTTCCTGCG TTATCCCCTG 

7911 TACCGCTCGC CGCAGCCGAA CGACCGAGCG 

7981 CGCAAACCGC CTCTCCCCGC GCGTTGGCCG 

8051 AAGCGGGCAG TGAGCGCAAC GCAATTAATG 

8121 TATGCTTCCG GCTCGTATGT TGTGTGGAAT 

8191 CATGATTACG CCAAGCTCGG AATTAACCCT 

8261 CAAGCCCCAC AGTGTGCCCT GAGGCTGCCC 

8331 * TAGTTTCAGT TACTTGCGTT CAGCCAAGGT 

8401 AAGAGACCAG CTTTACAGGG GGTTTATCAC 

8471 ACATTGCACC CTGACAGTCG TCAGCCTCAC 

8541 GATTCACAAT TTTTTTAGTC TCTACTGTGC 

8611 AGGGGAAAAC AGTATATACA GGGTTCAGTA 

8681 CCCCCGAGGG GTGATGACTA CCTCAGTTGG 

8751 TCCCAAGGCT ACCACAATGG GCCGCCCTCC 

8821 AGCACACCTG CGCATCAGAG TCCTTGGTGT 

8891 GAACAGAACC TAGGGAAAGC CCCAGTTCTA 



TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 
AAAAGATGCT GAAGATCAGT TGGGTGCACG AGTGGGTTAC 
CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA 
TATTATCCCG TATTGACGCC GGGCAAGAGC AACTCGGTCG 
TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC 
ATAACCATGA GTGATAACAC TGCGGCCAAC TTACT.TCTGA 
CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 
ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
GAACTACTTA CTCTAGCTTC CCGGCAACAA TTAATAGACT 
TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA 
CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC 
CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 
TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
GGTGAAGATC CTTTTTGATA ATCTCATGAC CAAAATCCCT 
GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT 
CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA 
AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTCCTT 
AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 
GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAAGGACC 
CTATGAGAAA GCGCCACGCT TCCCGAAGGG AGAAAGGCGG 
CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 
GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 
ATTCTGTGGA TAAGCGTATT ACCGCCTTTG AGTGAGCTGA 
CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA GCGCCCAATA 
ATTCATTAAT GCAGCTGGCA CGACAGGTTT CCCGACTGGA 
TGAGTTAGCT CACTCATTAG GCACCCCAGG CTTTACACTT 
TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC 
CACTAAAGGG AACAAAAGCT GCTGCAGGGT CCCTAACTGC 
CTTCCTTCTA GCGGCTGCCC CCACTCGGCT TTGCTTTCCC 
CTGAAACTAG GTGCGCACAG AGCGGTAAGA CTGCGAGAGA 
AGTGCACCCT GACAGTCGTC AGCCTCACAG GGGGTTTATC 
AGGGGGTTTA TCACAGTGCA CCCTTACAAT CATTCCATTT 
CTAACTTGTA AGTTAAATTT GATCAGAGGT GTGTTCCCAG 
CTATCGCATT TCAGGCCTCC ACCTGGGTCT TGGAATGTGT 
ATCTCCACAG GTCACAGTGA CACAAGATAA CCAAGACACC 
ACGTGCACAT GGCCGGAGGA ACTGCCATGT CGGAGGTGCA 
GGAGGGAGGG ACCAGCGCAG CTTCCAGCCA TCCACCTGAT 
CTTAGACCAG GAAAGGC 
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Gag production from the Rev-Independent gog-pol HIV 

1 vector pCMVBNkan 
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Reverse transcriptase activity from the Rev- 
Independent gag-pol HIV-1 vector pCMVBNkan 
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Transduction of 293 cells 
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Transduction of lymphoid cells (Jurkat) 



cvi 



CO 

s 

<5 




Rev- dependent 



mGag-Pol 



FIG. I5B 



WO 02/099101 



PCT/US02/17258 



39/45 



Tronsduction of myeloid cells (U937) 
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Transduction of human primary macrophages 
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BsrGI (37) 

1 CCTGGCCATTGCATACGTTGTATCCATATCATAATAT^ 

81 CATTGATTATTGACTAGTTATTAATAGTAATCA^ 
161 ATAACTTACGGTAAATGGCCCGCCTGGCTGACCCCCC^ 
241 TAGTAACGCCAATAGGGACTTTCCATTGACGT 
321 GTGTATCATATGCCAAGTACGCCCCCTATTGACGT 

SnaBI (432) 

401 CTTATGGGACTTTCCTACTTGGCAGTACATCTACGTA^ 
481 TCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCAC^ 
561 CACCAAAATCAACGGGACTTTCCAAAATGTCGTAAC^ 
641 GGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGT^ 

Sacil (746) 

721 GAAGACACCGGGACCGATCCAGCCTCCGCGGGCCGCGCTAA^ 

!► Me t Gi yCysLeuGl yAsnGI nLeuLeu 1 1 eAi a I I eL 

801 TGCTTTTAAGTGTCTATGGGATCTATTGTACT 

13^euLeuLeuSerValTyrGIyl I eTyrCysThr LeuTyrValThr Val PheTyrGl yValProAl aTrpArgAsnAI aThr 

881 ATTCCCCTCTTTTGTGCAACX2AAGAATAGGGATA 

40^ 1 1 eProLeuPheCysAI aThr LysAsnArgAspThrTrpGI yThrThrGI nCysLeuProAspAsnGI yAspTyrSerGI 

961 AGTGGCCCTTAATGTTACAGAAAGCTTTGATGCCTG 

66^ uVal A I aLeuAsnVal ThrGI uSer PheAspAl aTrpAsnAsnThr ValThrGI uGl nAI a 1 1 eGl uAspVal TrpGI nL 

1041 TCTTTGAGACCTCAATAAAGCCTTGTGTAAAATTATC 

93*euPheGI uThrSer 1 1 eLysProOysVal LysLeuSer ProLeuCys 1 1 eThrMat ArgCysAsnLysSer Gl uThrAsp 

1121 AGATGGGGATTGACAAAATCAATAACAACAACAGCAT 

120*ArgTrpGtyLeuThrLysSer I i eThrThrThr Al aSerThrThr SerThrThr Al aSerAI aLysVal AspMet Val As 

1201 TGAGACTAGTTCTTGTATAGCCCAGGATAATTG 

146^ nGI uThr Ser Ser Cys 1 1 eAI aGi nAspAsnCysThr Gl yLeuGI uGl nG! uGl nMat 1 1 eSer CysLy sPheAsnMet T 

Pstl (1329) 

1281 CAGGGTTAAAAAGAGACAAGAAAAAAGAGTACAATGAAA 

173* hr Gl yLeuLysArgAspLysLysLysGI uTyrAsnGI uThrTrpTyrSer Al aAspLeuVal CysGI uGl nGI yAsnAsn 

1361 ACTGGTAATGAAAGTAGATGTTACATGAACCACTGTAACA^ 

200^ Thr Gl yAsnGI uSer ArgCysTyrMetAsnHi sCysAsnThr Ser Val 1 1 eGl nGI uSer CysAspLysHI sTy rTrpAs 

1441 TGCTATTAGATTTAGGTATTGTGCACCTCCAGGTTATGC 

226^ pAI al I eArgPheArgTyrCysAI aProProGI yTyrAI aLeuLeuArgCysAsnAspThrAsnTyrSerG! yPheMet P 

1521 CTAAATGTTCTAAGGTGGTGGTCTCTTCMGCACAAGGAT 

253* r oLysCysSer LysVal Val Val Ser Ser CysThr ArgMetMetGI uThrGI nThr SerThrTrpPheGl yPheAsnGI y 
1601 ACTAGAGCAGAZ^TAGAACTTATATTTACTGGC^ 

280^ThrArgAlaGJ uAsnArgThrTyrll eTyrTrpHi sGJ yA rgAspAsnArgThr 1 1 el I eSer LeuAsnLysTyrTyrAs 

1681 TCTAACAATGAAATGTAGAAGACCAGGAAATAAGACAGTTCT 

306^ nLeuThrMet LysCysArgArgProGI yAsnLysThr Val LeuProVal Thr 1 1 eMetSer Gl yLeuVal PheHi sSerG 

Xcml (1778) 

1761 AACCAATCAATGATAGGCCAAAGCAGGCATGGTGTTGGTCT 

333^ I nProl I eAsnAspArgProLysGI nAI aTrpCysTrpPheGI yGI yLy sT rpLys AspAl a 1 1 eLysGI uVal LysGI n 

1841 ACCATTGTCAAACATCOCaGGTATACTGGAACT 

360* Thr ! I eVal LysHi sProArgTyrThr G! yThrAsnAsnThrAspLysl I eAsnLeuThr AlaProGI yGI yGI yAspPr 

1921 GGAAGTTACCTTCATGTGGACAAATTGCAGAGGAGAGTTCCTCTACT 

386> oGI uVal Thr PheMetTrpThrAsnCysArgGI yGI uPheLeuTy rCysLysMatAsnTrpPheLeuAsnTrpVal Gl uA 

2001 ATAGGAATACAGCTAACCAGAAGCCAAAGGAA 

413> spArgAsnThr Al aAsnGI nLysProLysGI uGl nHi sLysArgAsnTyrVal ProCysHi s 1 1 eArgGI nl I el I eAsn 

FIG. 17 
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Pmll (2134) 

2081 ACTTGGCATAAAGTAGGCAAAAATGTT TATT TGCCTCCAAGAGAGGGAGACC TCACGTGTAACTCC ACAGTGACC AGTC T 

440>ThrTrpHisLysValGI yLysAsnVal TyrLeuProProArgGI uGI yAspLeuThr CysAsnSerThr Val Thr Ser Le 
2161 C ATAGCAAACATAGATTGG ATTG ATGGAAACCAAAC TAAT ATCACCATGAGTGCAGAGGTGGCAGAACTGT ATCG AT TGG 

466^ u 1 1 e Al aAsn I I eAspTrp 1 1 eAspGl yAsnGi nThr Asn 1 1 eThr Met Ser Al aGI uVal Al aGI uLeuTyrArgLeuG 

2241 AATTGGGAGATTATAAATTAGTAGAGATCACTCCAATTGGCTT© 

493> I uLeuGI yAspTyrLysLeuVal GI u I I eThr Pro 1 1 eGl yLeuAl aProThrAspVal LysArgTyrThrThr Gl yGI y 

BspMI (2378) 

2321 ACCTCAAGAAATAAAAGAGGGGTCTTTGTGCTAGGGTOT 

520>ThrSerArgAsnLysArgGI yVal PheVal LeuGI yPheLeuGI yPheLeuAl aThr Al aGIySer Al ahfetGI yA! aA! 

2401 CAGCCTGACCCTCACGGCACAGTCCCGAACTT 

546> aSer LeuThr LeuThr Al aGI nSer ArgThr LeuLeuAl aGI y 1 1 eVai Gl nGl nGI nGI nGl nLeuLeuAspVal Val L 

Earn 11 051 (2502) 
2481 AGAGACAACAAGAATTGTTGCGACTGACCGTCTGGGGAACAAAGA 

573*ysArgGl nGI nGI uLeuLeuArg LeuThr Val TrpGI yThr LysAsnLeuGl nThr A rg Val Thr Al al I eGl uLysTyr 

2561 TTAAAGGACCAGGCGCAGCTGAATGCTTGGGGATGTGCGTTT^ 

600> LeuLysAspGI nAI aG! nLeuAsnAI aTrpGl yCysAI aPheArgGl nVal CysHi sThr Thr Val ProTrpProAsnAI 

2641 AAGTCTAACACCAAAGTGGAACAATGAGACTTGGCAAGAGTGGGAGCGAAAC^ 

626^ aSer LeuThr ProLysT rpAsnAsnGI uThr TrpGI nGI uTrpGI uArgLysVal AspPheLeuGl uGI uAsn I I eThr A 

2721 CCC TC C TAG AGGAGGCACAAATTCAAC AAGAGAAGAACATGTATGAATTACAAAAG TTGAATAGCTGGGATGTGT TTGGC 

653^ I aLeuLeuGi uGI uAI aGI nil eGl nGI nGI uLysAsnMetTyrGI uLeuGI nLysLeuAsnSer TrpAspVai PheGl y 
2801 AATTGGTTTGACCTTGCTTCTTGGAT^ 

680^ AsnTrpPheAspLeuAl aSerTrpI \ eLysTyrl I eGl nTyrGl yValTy rl I eVal Val Gl yVal 1 1 eLeuLeuArg I I 

2881 AGTGATCTATATAGTACAAATGCT^ 

706>eVal MeTyrlleValGI nNfetLeuAl aLysLeuArgGl nGI yTyrArgProVal PheSer Ser ProProSer TyrPheG 

PpuMI (2979) 
2961 AGCAGACCCATATCCAACAGGACCCGGCACTGC 

733> I nGI nThr Hi s II eGl nGI nAspProAl aLeuProThr ArgGl uGI yLysGI uArgAspGI yGI yGI uGI yGI yGI yAsn 

3041 AGCTCCTGGCCTTGGCAGATAGAATATATCCACTTTCT 

760*>SerSerTrpProTrpGlnlleGluTyrlleHisPheLeulleArgGI nLeu 1 1 eArgLeuLeuThrTrpLeuPheSerAs 

3121 CTGTAGGACTTTGCTATCGAGAGTATACCAGATCCTCCAACCAATACTCCAGAGGCTCTCTG^ 

786^nCysArgThr LeuLeuSer ArgVal TyrGI n 1 1 eLeuGI nProll eLeuGI nArgLeuSer Al aThr LeuGI nArg I I eA 

3201 GAGAAGTCcS 

813* r aGI uVal LeuArgThr Gl uLeuThrTy rLeuGI nTyrGl yTrpSer TyrPheHi sGI uAI aVal Gl nAI aValT rpArg 

3281 TCTGCGACAGAGACTCTTGCGGGCGCGTGGGGAGACTTATGGGAGACTCTTAGGAGAGG 

84Q> Ser Al aThr Gl uThr LeuAl aGI yAl aT rpGI yAspLeuT rpGI uThr Leu Arg ArgGl yGI yA rgTrp ! I eLeuAl a 1 1 

BamHl (3418) 
EcoRl (3412) 

3361 CCCCAGGAGGATTAGAQkAGGGCTTGA 

866^eProArgArglleArgGlnG!yLeuGluLeuThrLeuLeu* • • 

Eco47lll (3457) 

3441 GGGGGGGCCCGGTACG&GCGC TTAGCTAGCTAGftGRCC^^CTCCCCTGCGAGCTaRGCTG^ 

3521 ^^(^ ^r^^aA^^^^^f^^^^^rQ^^^M^^^^^^ 

BstEII (3673) 

3601 AftTGTATCACTCCAACCTAAGACAGGCGCAGCTre 
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BsaBI (3740) 

3681 GTCCGGAGCCGTGCTGCCCGGATGATGTCTT^ 
3761 CTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCC^ 
3841 TAATAAAATGAGGAAATTGCATCGCATTC^ 

Sphl (3948) Kpnl (3976) 

3921 GGGGGAGGATTGGGAAGACAATAGCAGGCATGCTGGGGAT GCGGTGGGC^ 

BstXI (4060) 

4001 CCGGTTCCTCCTGGGCCAGAAAGAAGCAGGCACATCCCCT TCTCTGTGACACACCCTGTCC^CGCCCCTGGTTCTTAGTT 
4081 CGAGCCCC&GTCRTAGGRCACT 

4161 TCCCTCCCTCATCAGCCCACCAAACCAAACCTAGCCTCCAAGAGTGGGAAG AAAT^ 

Xmnl (4293) 

4241 AGAGGGAGAGAAAATGCCTCCAACATGTGAGGAAGTAAT^ 
4321 CTCGCTGCGCTCGGTCGTTCGGCT^ 

4401 GGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGGAAAAGGCCAGG 

4481 TTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGT 

4561 AAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTT 

4 641 CCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCT 

4721 AAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCT^ 
4801 GGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAAC^GGATO 
4881 AGTTCTTGAAGTGGTGGCCTAACTACGGC^^ 
4961 TTCGGAAAAAGAGTTGCTAGCTCTTGM 

5041 GATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCT 
5121 CACGTTAAGGGATTTTGGTC^TGAGATTATCAAAAAGGATCTTCACCT 
5201 TCAATCTAAAGTATATATGAGTAAACTT^ 

5281 TCTATTTCGTTCATCCATAGTTGCCTGACTCCGGGGGGGGGGGGCGCTGAGGTCTGCCTC 

Stul (5368) 

5361 TCATACCAGGCCTGAATCGCCCCATCATCCAGCCAGAAAGTGAGGGAGCCACGGTTGATGA^ 

5441 CCAGTTGGTGATTTTGAACTTTTGCTTTGCCACGGAACGGTCTGCGTTGTC 
5521 CAGCAAAAGTTCGATTTATTCAACAAAGCCGCCGTCCCGTCAAGTCAGCOT 
5601 CCAATTCTGATTAGAAAAACTCATCGAGCATCTU^TGAAACTGCAATTT 

2714 PhePheGI uAspLeuMat Leu Hi s PheGI nLeuLysAsnMatAspProAsnAspl I eGl yTyrLysG 
5681 TGAAAAAGCCGTTTCICTAATGAAGGAGAA^ 

2484 I nPheLeuArgLysG! nLeuSer ProSer PheGI uGl yLeuCysAsnTrpLeul I eAI aLeuAspG! nTyrArgAspAi a 
5761 GATTCCGACTCGTCCAACATCAATACAAC^ 

2224 1 1 eGl yValArgGI yValAspl leCysG!yl I eLeuLysGI yGl uAspPfiel I eLeuAsnAspLeuSer PheAspGI yHi 
5841 GAGTGACGACTGAATCCGGTGAGAATGGCAAAAGC 
1954sThrVal Val SerAspProSer PheProLeuLeuLysHi sMetGI uLysTrpVal Gl nGI uValProTrpGI yAsnArgG 

Pvul (5993) 
Sgfl (5992) 

5921 TCGTCATCAAAATCACTCGGATCAACCAAACCG 

168< I uAspAspPheAspSer Al aAspVal LeuGJ yAsnAsnMet ArgSer Gl nAI aGI nAI aLeuArgPheVal A rgAspSer 

BsrFI (6036) Sspl (6067) 

6001 GTTAAAAGGACAATTACAAACAGGAATCXSAATGCAACCGGCGCAG 

142^AsnPheProCysAsnCysValProlleSerHisLeuArgArgLeuPheValAIaLeuAl aAspVal 1 1 eAsnGI uG! ySe 

Smal (6118) 

6081 AATCAGGATATTCTTCTAATACCTGGAATGCTGTTTT(XCGGGGATCGCAGTGCT 

1154 rAspProTyrGI uG! uLeuVal Gl nPheAl aThr LysGI yProl I eAI aThrThr LeuLeuTrpAl aAspAspP roThr A 
6161 CGGATAAAATGCTTGATGGTCGGAAGAGGCATAAATTCCGTCAGCCAGTTTM 

884 r g 1 1 ePheHi sLys I \ eThr ProLeuProMet PheG! uThr LeuT rpAsnLeuArgVal Met Gl uAspThr Va I AspAsn 
6241 GGCAACGCTACCTTTGCGATGTTTCAGAAACA 

62<AI aVal Ser GJ yLysGI yHi sLysLeuPheLeuGI uProAi aAspProLysG! yTyrLeuArgTyrl I eThr Al aGI ySe 

Nrul (6335) 

6321 ATTGCCCGACATTATCGCGAGCCCATTTATACCCATATAAATCAGCATCCA 

354rGlnGl yVal AsnAspArgAI aTrpLysTyrGI yTyrLeuAspAl aAspMetAsnSerAsnLeuArgP ro ArgSer CysS 
6401 GACGTTTCCCGTTGAATATGGCTCATAACACCCCTTCT 
~ 84erThrGluArgGlnlleHisSerMet 

Dralll (6523) 

6481 TATATTTTTATCTTGTGCAATGTAACATCAGAGATTTT^ 
6561 TTTATCAGGGTTATTGTCTCAT^ 
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TTTCCCCGAAAAGTGCCACCTC^^ 
67 oi GCCCTTTCGTCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGG^ 

6801 TGTAAGCC^TGCCGGGAG^^ 

6881 G0GGCATCAGAGCAGATTGTACT6AGAGTQCACCATAT 



6641 
6721 



6961 C ATCAG AT TGGCTATTGG 
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SEQUENCE LISTING 

<110> The Government of the United States of America, as 

<12 0> MOLECULAR CLONES WITH MUTATED HIV GAG /POL, SIV GAG AND 
SIV ENV GENES 

<130> 2026-4287PC2 HIV GAG/ POL, SIV GAG & ENV 

<140> TO BE ASSIGNED 
<141> 2002-05-31 

<150> 09/872,733 
<151> 2001-06-01 

<160> 19 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 4338 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mutated Human 
Immunodeficiency Virus - 1 Gag/ Pol gene 

<400> 1 

atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgatggga aaaaattcgg 6 0 

ttaaggccag ggggaaagaa gtacaagcta aagcacatcg tatgggcaag cagggagcta 12 0 

gaacgattcg cagttaatcc tggcctgtta gaaacatcag aaggctgtag acaaatactg 180 

ggacagctac aaccatccct tcagacagga tcagaggagc ttcgatcact atacaacaca 240 

gtagcaaccc tctattgtgt gcaccagcgg atcgagatca aggacaccaa ggaagcttta 3 00 

gacaagatag aggaagagca aaacaagtcc aagaagaagg cccagcaggc agcagctgac 3 60 

acaggacaca gcaatcaggt cagccaaaat taccctatag tgcagaacat ccaggggcaa 42 0 

atggtacatc aggccatatc acctagaact ttaaatgcat gggtaaaagt agtagaagag 48 0 

aaggctttca gcccagaagt gatacccatg ttttcagcat tatcagaagg agccacccca 54 0 

caggacctga acacgatgtt gaacaccgtg gggggacatc aagcagccat gcaaatgtta 6 00 

aaagagacca tcaatgagga agctgcagaa tgggatagag tgcatccagt gcatgcaggg 660 

cctattgcac caggccagat gagagaacca aggggaagtg acatagcagg aactactagt 72 0 

acccttcagg aacaaatagg atggatgaca aataatccac ctatcccagt aggagagatc 780 

tacaagaggt ggataatcct gggattgaac aagatcgtga ggatgtatag ccctaccagc 84 0 

attctggaca taagacaagg accaaaggaa ccctttagag actatgtaga ccggttctat 90 0 

aaaactctaa gagctgagca agcttcacag gaggtaaaaa attggatgac agaaaccttg 96 0 

ttggtccaaa atgcgaaccc agattgtaag accatcctga aggctctcgg cccagcggct 102 0 

acactagaag aaatgatgac agcatgtcag ggagtaggag gacccggcca taaggcaaga 108 0 

gttttggccg aggcgatgag ccaggtgacg aactcggcga ccataatgat gcagagaggc 1140 
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* ..aact^tccgga accagcggaa gatcgtcaag 
gccaggaact gccgggcccc ccggaagaag 
caaatgaaag attgtactga gagacaggct 
aagggaaggc cagggaattt tcttcagagc 
agcttcaggt ctggggtaga gacaacaact 
gaactgtatc ctttaacttc cctcagatca 
ggatcggggg gcaactcaag gaagcgctgc 
aagaaatgag tfctgccagga agatggaaac 
tcaaggtgag gcagtacgac cagatactca 
cagtattagt aggacctacc tacacctgtc 
atcggctgca ccttgaactt ccccatcagc 
ccggggatgg acggccccaa ggtcaagcaa 
ttagtcgaaa tctgtacaga gatggagaag 
aacccctaca acactccagt cttcgcaatc 
ctggtggact tcagagagct gaacaagaga 
atcccacatc ccgctgggtt gaagaagaag 
gcctacttct ccgttccctt ggacgaggac 
agcatcaaca acgagacacc aggcatccgc 
aagggatcac cagccatctt tcaaagcagc 
caaaacccag acatcgtgat ctatcagtac 
gagatcgggg cagcacagga ccaagatcga 
actgaccaca ccagacaaga agcaccagaa 
actgcatcct gacaagtgga cagtgcagcc 
tgtgaacgac atacagaagc tcgtgggcaa 
catcaaagtt aggcagctgt gcaagcfcgct 
cccactgaca gaggaagcag agctagaact 
agtacatgga gtgtactacg acccaagcaa 
gcaaggccaa tggacctacc aaatctacca 
gtacgcaagg atgaggggtg cccacaccaa 
gaagatcacc acagagagca tcgtgatctg 
acagaaggag acatgggaga catggtggac 
gtgggagttc gtgaacaccc ctcccttggt 
catcgtggga gcagagacct tctacgtgga 
caaggcaggc tacgtgacca accgaggacg 
caaccagaag actgagctgc aagccatcta 
gaacatcgtg acagactcac agtacgcatg 
gagtcagagc tggtgaacca gatcatcgag 
gcatgggtac cagcacacaa aggaattgga 
gctgggat.cc ggaaggtgct gttcctggac 
aagtaccact ccaactggcg cgctatggcc 
aaagaaatag tagccagctg tgataaatgt 
gtagactgta gtccaggaat atggcagctg 
ctggtagcag ttcatgtagc cagtggatat 
gggcaggaaa cagcatattt tcttttaaaa 
cacacggaca acggaagcaa cttcactggt 
ggaatcaagc aggaatttgg aattccctac 
atgaacaagg agctgaagaa gatcatcgga 
cagcagtgca gatggcagtg ttcatccaca 



tgcttcaatt gtggcaaaga agggcacacc 12 0 0 
ggctgttgga aatgtggaaa ggaaggacac 126 0 
aattttttag ggaagatctg gccttcctac 132 0 
agaccagagc caacagcccc accagaagag 13 8 0 
ccccctcaga agcaggagcc gatagacaag 144 0 
ctctttggca acgacccctc gtcacagtaa 1500 
tcgatacagg agcagatgat acagtattag 1560 
caaaaatgat aggggggatc gggggcttca 1620 
tagaaatctg tggacataaa gctataggta 1680 
aacataattg gaagaaatct gttgacccag 1740 
cctattgaga cggtgcccgt gaagttgaag 1800 
tggccattga cgaaagagaa gatcaaggcc 1860 
gaagggaaga tcagcaagat cgggcctgag 192 0 
aagaagaagg acagtaccaa gtggagaaag 198 0 
actcaggact tctgggaagt tcagctgggc 204 0 
aagtcagtga cagtgctgga tgtgggtgat 2100 
ttcaggaagt acactgcctt cacgatacct 216 0 
taccagtaca acgtgctgcc acagggatgg 222 0 
atgaccaaga tcctggagcc cttccgcaag 2280 
atggacgacc tctacgtagg aagtgacctg 2340 
ggagctgaga cagcatctgt tgaggtgggg 2400 
ggaacctccc ttcctgtgga tgggctacga 2460 
catcgtgctg cctgagaagg acagctggac 252 0 
gttgaactgg gcaagccaga tctacccagg 258 0 
tcgaggaacc aaggcactga cagaagtgat 264 0 
ggcagagaac cgagagatcc tgaaggagcc 2 70 0 
ggacctgatc gcagagatcc agaagcaggg 2 76 0 
ggagcccttc aagaacctga agacaggcaa 2 82 0 
cgatgtgaag cagctgacag aggcagtgca 2 88 0 
gggcaagact cccaagttca agctgcccat 294 0 
cgagtactgg caagccacct ggatccctga 3 00 0 
gaaactgtgg tatcagctgg agaaggaacc 3 06 0 
tggggcagcc aacagggaga ccaagctggg 312 0 
acagaaagtg gtgaccctga ctgacaccac 318 0 
cctagctctg caagacagcg gactggaagt 324 0 
ggcatcatcc aagcacaacc agaccaatcc 330 0 
cagctgatca agaaggagaa agtgtacctg 336 0 
ggaaatgaac aagtagataa attagtcagt 342 0 
gggatcgata aggcccaaga tgaacatgag 348 0 
agcgacttca acctgccacc tgtagtagca 3 540 
cagctaaaag gagaagccat gcatggacaa 360 0 
gactgcacgc acctggaggg gaaggtgatc 3 66 0 
atagaagcag aagttatccc tgctgaaact 3 72 0 
ttagcaggaa gatggccagt aaaaacaata 3 78 0 
gctacggtta aggccgcctg ttggtgggcg 3 84 0 
aatccccaat cgcaaggagt cgtggagagc 3 900 
cagtgaggga tcaggctgag cacctgaaga 3960 
acttcaaaag aaaagggggg attggggggt 402 0 
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acag'fcgcagg ggaaaggatc gtggacatca tcgccaccga catccaaacc aaggagctgc 4 08 0 
agaagcagat caccaagatc cagaacttcc gggtgtacta c.cgcgacagc cgcaacccac 414 0 
tgtggaaggg accagcaaag ctcctctgga agggagaggg ggcagtggtg atccaggaca 42 0 0 
acagtgacat caaagtggtg ccaaggcgca aggccaagat catccgcgac tatggaaaac 426 0 
agatggcagg tgatgattgt gtggcaagta gacaggatga ggattagaac ctggaagagc 432 0 
ctggtgaagc accatatg 433 8 



<210> 2 
<211> 2507 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 2 

tgtacagaga tggaaaagga agggaaaatt tcaaaaattg ggcctgaaaa tccatacaat 6 0 
actccagtat ttgccataaa gaaaaaagac agtactaaat ggagaaaatt agtagatttc 12 0 
agagaactta ataagagaac tcaagacttc tgggaagttc aatfcaggaat accacatccc 180 
gcagggttaa aaaagaaaaa atcagtaaca gtactggatg tgggtgatgc atatttttca 24 0 
gttcccttag atgaagactt caggaaatat actgcattta ccatacctag tataaacaat 3 00 
gagacaccag ggattagata ccatacctag tataaacaat gagacaccag ggatttgata 360 
tcagtacaat gtgcttccac agggatggaa aggatcacca gcaatattcc aaagtagcat 42 0 
gacaaaaatc ttagagcctt ttagaaaaca aaatccagac atagttatct atcaatacat 480 
ggatgatttg tatgtaggat ctgacttaga aatagggcag catagaacaa aaatagagga 540 
gctgagacaa catctgttga ggtggggact taccacacca gacaaaaaac atcagaaaga 6 00 
acctccattc ctttggatgg gttatgaact ccatcctgat aaatggacag tacagcctat 660 
agtgctgcca gaaaaagaca gctggactgt caatgacata cagaagttag tggggaaatt 72 0 
gaattgggca agtcagattt acccagggat taaagtaagg caattatgta aactccttag 78 0 
aggaaccaaa gcactaacag aagtaatacc actaacagaa gaagcagagc tagaactggc 84 0 
agaaaacaga gagattctaa aagaaccagt acatggagtg tattatgacc catcaaaaga 900 
cttaatagca gaaatacaga agcaggggca aggccaatgg acatatcaaa tttatcaaga 960 
gccatttaaa aatctgaaaa caggaaaata tgcaagaatg aggggtgccc acactaatga 102 0 
tgtaaaacaa ttaacagagg cagtgcaaaa aataaccaca gaaagcatag taatatgggg 108 0 
aaagactcct aaatttaaac tgcccataca aaaggaaaca tgggaaacat ggtggacaga 1140 
gtattggcaa gccacctgga ttcctgagtg ggagtttgtt aatacccctc ctttagtgaa 12 00 
attatggtac cagttagaga aagaacccat agtaggagca gaaaccttct atgtagatgg 126 0 
ggcagctaac agggagacta aattaggaaa agcaggatat gttactaata gaggaagaca 132 0 
aaaagttgtc accctaactg acacaacaaa tcagaagact gagttacaag caatttatct 13 8 0 
agctttgcag gattcgggat tagaagtaaa catagtaaca gactcacaat atgcattagg 1440 
aatcattcaa gcacaaccag atcaaagtga atcagagtta gtcaatcaaa taatagagca 150 0 
gttaataaaa aaggaaaagg tctatctggc atgggtacca gcacacaaag gaattggagg 1560 
aaatgaacaa gtagataaat tagtcagtgc tggaatcagg aaagtactat ttttagatgg 162 0 
aatagataag gcccaagatg aacatgagaa atatcacagt aattggagag caatggctag 168 0 
tgattttaac ctgccacctg tagtagcaaa agaaatagta gccagctgtg ataaatgtca 174 0 
gctaaaagga gaagccatgc atggacaagt agactgtagt ccaggaatat ggcaactaga 18 00 
ttgtacacat ttagaaggaa aagttatcct ggtagcagtt catgtagcca gtggatatat 1860 
agaagcagaa gttattccag cagaaacagg gcaggaaaca gcatattttc ttttaaaatt 1920 
agcaggaaga tggccagtaa aaacaataca tacagacaat ggcagcaatt tcaccagtgc 1980 
tacggttaag gccgcctgtt ggtgggcggg aatcaagcag gaatttggaa ttccctacaa 2 040 
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tccccaaagt caaggagtag tagaatctat 
ggtaagagat caggctgaac atcttaagac 
ttttaaaaga aaagggggga ttggggggta 
agcaacagac atacaaacta aagaattaca 
ggtttattac agggacagca gaaatccact 
aggtgaaggg gcagtagtaa tacaagataa 
agcaaagatc attagggatt atggaaaaca 
acaggatgag gattagaaca tggaaaagtt 



gaataaagaa ttaaagaaaa ttataggaca 210 0 
agcagtacaa atggcagtat tcatccacaa 216 0 
cagtgcaggg gaaagaatag tagacataat 222 0 
aaaacaaatt acaaaaattc aaaattttcg 2280 
ttggaaagga ccagcaaagc tcctctggaa 234 0 
tagtgacata aaagtagtgc caagaagaaa 2400 
gatggcaggt gatgattgtg tggcaagtag 2460 
tagtaaaaca ccatatg 25 07 



<210> 3 
<211> 2467 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mutated Human 
Immunodeficiency Virus - 1 Pol gene 

<400> 3 

tgtacagaga tggagaagga agggaagatc agcaagatcg ggcctgagaa cccctacaac 60 
actccagtct tcgcaatcaa gaagaaggac agtaccaagt ggagaaagct ggtggacttc 12 0 
agagagctga acaagagaac tcaggacttc tgggaagttc agctgggcat cccacatccc 18 0 
gctgggttga agaagaagaa gtcagtgaca gtgctggatg tgggtgatgc ctacttctcc 24 0 
gttcccttgg acgaggactt caggaagtac actgccttca cgatacctag catcaacaac 3 00 
gagacaccag gcatccgcta ccagtacaac gtgctgccac agggatggaa gggat caeca 36 0 
gccatctttc aaagcagcat gaccaagatc ctggagccct tccgcaagca aaacccagac 42 0 
ategtgatet atcagtacat ggacgacctc tacgtaggaa gtgacctgga gategggcag 480 
cacaggacca agatcgagga gctgagacag catctgttga ggtggggact gaccacacca 54 0 
gacaagaagc accagaagga acctcccttc ctgtggatgg gctacgaact gcatcctgac 6 00 
aagtggacag tgcagcccat cgtgctgcct gagaaggaca gctggactgt gaacgacata 660 
cagaagctcg tgggcaagtt gaactgggca agecagatet acccaggcat caaagttagg 72 0 
cagctgtgca agetgetteg aggaaccaag gcactgacag aagtgatccc actgacagag 78 0 
gaagcagagc tagaactggc agagaaccga gagatcctga aggagecagt acatggagtg 84 0 
tactacgacc caagcaagga cctgatcgca gagatccaga agcaggggca aggecaatgg 90 0 
acctaccaaa tctaccagga gcccttcaag aacctgaaga caggcaagta cgcaaggatg 960 
aggggtgece acaccaacga tgtgaagcag ctgacagagg cagtgcagaa gatcaccaca 102 0 
gagagcatcg tgatctgggg caagactccc aagttcaagc tgcccataca gaaggagaca 10 8 0 
tgggagacat ggtggaccga gtactggcaa gccacctgga tccctgagtg ggagttcgtg 114 0 
aacacccctc ccttggtgaa actgtggtat cagctggaga aggaacccat cgtgggagca 12 0 0 
gagaccttct acgtggatgg ggcagccaac agggagacca agctgggcaa ggcaggctac 126 0 
gtgaccaacc gaggacgaca gaaagtggtg accctgactg acaccaccaa ccagaagact 132 0 
gagctgeaag ccatctacct agetctgeaa gaeageggae tggaagtgaa categtgaca 138 0 
gactcacagt acgcactggg catcatccaa gcacaaccag accaatccga gtcagagctg 144 0 
gtgaaccaga tcatcgagca gctgatcaag aaggagaaag tgtacctggc atgggtacca 150 0 
gcacacaaag gaattggagg aaatgaacaa gtagataaat tagtcagtgc tgggatcegg 156 0 
aaggtgctgt tectggaegg gatcgataag geccaagatg aacatgagaa gtaccactcc 162 0 
aactggcgcg ctatggccag cgacttcaac ctgccacctg tagtagcaaa agaaatagta 168 0 
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i gcca'gctgtg ataaatgtca gctaaaagga 
ccaggaatat ggcagctgga ctgcacgcac 
catgtagcca gtggatatat agaagcagaa 
gcatattttc ttttaaaatt agcaggaaga 
ggaagcaact tcactggtgc tacggttaag 
gaatttggaa ttccctacaa tccccaatcg 
ctgaagaaga tcatcggaca agtgagggat 
atggcagtgt tcatccacaa cttcaaaaga 
gaaaggatcg tggacatcat cgccaccgac 
accaagatcc agaacttccg ggtgtacfcac 
ccagcaaagc tcctctggaa gggagagggg 
aaagtggtgc caaggcgcaa ggccaagatc 
gatgattgtg tggcaagtag acaggatgag 
ccatatg 



gaagccatgc atggacaagt agactgtagt 174 0 
ctggagggga aggtgatcct ggtagcagtt 18 00 
gttatccctg ctgaaactgg gcaggaaaca 1860 
tggccagtaa aaacaataca cacggacaac 192 0 
gccgcctgtt ggtgggcggg aatcaagcag 1980 
caaggagtcg tggagagcat gaacaaggag 2 040 
caggctgagc acctgaagac agcagtgcag 2100 
aaagggggga ttggggggta cagtgcaggg 216 0 
atccaaacca aggagctgca gaagcagatc 222 0 
cgcgacagcc gcaacccact gtggaaggga 22 80 
gcagtggtga tccaggacaa cagtgacatc 234 0 
atccgcgact atggaaaaca gatggcaggt 24 00 
gattagaacc tggaagagcc tggtgaagca 2460 

2467 



<210> 4 
<211> 1533 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mutated 
Simian Immunodeficiency Virus Gag gene 

<400> 4 

atgggcgtga gaaactccgt cttgtcaggg aagaaagcag atgaattaga aaaaattagg 60 
ctacgaccca acggaaagaa aaagtacatg ttgaagcatg tagtatgggc agcaaatgaa 12 0 
ttagatagat ttggattagc agaaagcctg ttggagaaca aagaaggatg tcaaaaaata 18 0 
ctttcggtct tagctccatt agtgccaaca ggctcagaaa atttaaaaag cctttataat 24 0 
actgtctgcg tcatctggtg cattcacgca gaagagaaag tgaaacacac tgaggaagca 3 00 
aaacagatag tgcagagaca cctagtggtg gaaacaggaa ccaccgaaac catgccgaag 3 60 
acctctcgac caacagcacc atctagcggc agaggaggaa actacccagt acagcagatc 42 0 
ggtggtaact acgtccacct gccactgtcc ccgagaaccc tgaacgcttg ggtcaagctg 480 
atcgaggaga agaagttcgg agcagaagta gtgccaggat tccaggcact gtcagaaggt 540 
tgcaccccct acgacatcaa ccagatgctg aactgcgttg gagaccatca ggcggctatg 6 00 
cagatcatcc gtgacatcat caacgaggag gctgcagatt gggacttgca gcacccacaa 660 
ccagctccac aacaaggaca acttagggag ccgtcaggat cagacatcgc aggaaccacc 72 0 
tcctcagttg acgaacagat ccagtggatg taccgtcagc agaacccgat cccagtaggc 78 0 
aacatctacc gtcgatggat ccagctgggt ctgcagaagt gcgtccgtat gtacaacccg 840 
accaacattc tagatgtaaa acaagggcca aaagagccat ttcagagcta tgtagacagg 900 
ttctacaaaa gtttaagagc agaacagaca gatgcagcag taaagaattg gatgactcaa 960 
acactgctga ttcaaaatgc taacccagat tgcaagctag tgctgaaggg gctgggtgtg 102 0 
aatcccaccc tagaagaaat gctgacggct tgtcaaggag taggggggcc gggacagaag 108 0 
gctagattaa tggcagaagc cctgaaagag gccctcgcac cagtgccaat cccttttgca 114 0 
gcagcccaac agaggggacc aagaaagcca attaagtgtt ggaattgtgg gaaagaggga 12 0 0 
cactctgcaa ggcaatgcag agccccaaga agacagggat gctggaaatg tggaaaaatg 1260 
gaccatgtta tggccaaatg cccagacaga caggcgggtt ttttaggcct tggtccatgg 132 0 
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1 ggaaagaagc cccgcaattt ccccatggct caagtgcatc aggggctgat gccaactgct 
cccccagagg acccagctgt ggatctgcta aagaactaca tgcagttggg caagcagcag 
agagaaaagc agagagaaag cagagagaag ccttacaagg aggtgacaga ggatttgctg 
cacctcaatt ctctctttgg aggagaccag tag 

<210> 5 
<211> 1532 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 

sequence of mutated Simian Immunodeficiency Virus 
Gag gene (SIVgagDX) with wild-type SIV 239 Gag 
gene 

<400> 5 

atgggcgtga gaaactccgt cttgtcaggg aagaaagcag atgaattaga aaaaattagg 6 0 
ctacgaccca acggaaagaa aaagtacatg ttgaagcatg tagtatgggc agcaaatgaa 12 0 
ttagatagat ttggattagc agaaagcctg ttggagaaca aagaaggatg tcaaaaaata 18 0 
ctttcggtct tagctccatt agtgccaaca ggctcagaaa atttaaaaag cctttataat 24 0 
actgtctgcg tcatctggtg cattcacgca gaagagaaag tgaaacacac tgaggaagca 3 00 
aaacagatag tgcagagaca cctagtggtg gaaacaggaa cmacmgaaac yatgccraar 36 0 
acmwstmgac caacagcacc atctagcggc agaggaggaa aytacccagt acarcaratm 42 0 
ggtggtaact aygtccacct gccaytrwsc ccgagaacmy traaygcytg ggtmaarytg 48 0 
atmgaggara agaarttygg agcagaagta gtgccaggat tycaggcact gtcagaaggt 54 0 
tgcaccccct aygacatyaa ycagatgytr aaytgygtkg gagaccatca rgcggctatg 600 
cagatyatcm gwgayatyat maacgaggag gctgcagatg ggacttgcag cacccacaac 66 0 
cagctccaca acaaggacaa cttagggagc cgtcaggatc agayatygca ggaacmacyw 72 0 
sytcagtwga ygaacaratc cagtggatgt acmgwcarca gaacccsatm ccagtaggca 7 80 
acatytacmg kmgatggatc carctgggky tgcaraartg ygtymgwatg tayaacccra 840 
cmaacattct agatgtaaaa caagggccaa aagagccatt tcagagctat gtagacaggt 900 
tctacaaaag tttaagagca gaacagacag atgcagcagt aaagaattgg atgactcaaa 960 
cactgctgat tcaaaatgct aacccagatt gcaagctagt gctgaagggg ctgggtgtga 102 0 
atcccaccct agaagaaatg ctgacggctt gtcaaggagt aggggggccg ggacagaagg 108 0 
ctagattaat ggcagaagcc ctgaaagagg ccctcgcacc agtgccaatc ccttttgcag 1140 
cagcccaaca gaggggacca agaaagccaa ttaagtgttg gaattgtggg aaagagggac 12 00 
actctgcaag gcaatgcaga gccccaagaa gacagggatg ctggaaatgt ggaaaaatgg 1260 
accatgttat ggccaaatgc ccagacagac aggcgggttt tttaggcctt ggtccatggg 132 0 
gaaagaagcc ccgcaatttc cccatggctc aagtgcatca ggggctgatg ccaactgctc 13 8 0 
ccccagagga cccagctgtg gatctgctaa agaactacat gcagttgggc aagcagcaga 1440 
gagaaaagca gagagaaagc agagagaagc cttacaagga ggtgacagag gatttgctgc 150 0 
acctcaattc tctctttgga ggagaccagt ag 1532 

<210> 6 
<211> 8366 



1380 
1440 
1500 
1533 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
of the construct pCMVgagpolBNKan containing a CMV 
promoter, a HIV gag/pol gene and a kanamycin 
resistance gene 

<400> 6 

cctggccatt gcatacgttg tatccatatc ataatatgta catttatatt ggctcatgtc 60 
caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa tcaattacgg 12 0 
ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc 18 0 
cgcctggctg accgcccaac gacccccgcc cattgacgtc aataatgacg tatgttccca 24 0 
tagtaacgcc aatagggact ttccattgac gtcaatgggt ggagfcattta cggtaaactg 30 0 
cccacttggc agtacatcaa gtgtatcata tgccaagtac gccccctatt gacgtcaatg 36 0 
acggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt 42 0 
ggcagtacat ctacgtatta gtcatcgcta ttaccatggt gatgcggttt tggcagtaca 48 0 
tcaatgggcg tggatagcgg tttgactcac ggggatttcc aagtctccac cccattgacg 54 0 
tcaatgggag tttgttttgg caccaaaatc aacgggactt tccaaaatgt cgtaacaact 60 0 
ccgccccatt gacgcaaatg ggcggtaggc gtgtacggtg ggaggtctat ataagcagag 66 0 
ctcgtttagt gaaccgtcag atcgcctgga gacgccatcc acgctgtttt gacctccata 72 0 
gaagacaccg ggaccgatcc agcctccgcg ggcgcgcgtc gacagagaga tgggtgcgag 78 0 
agcgtcagta ttaagcgggg gagaattaga tcgatgggaa aaaattcggt taaggccagg 84 0 
gggaaagaag aagtacaagc taaagcacat cgtatgggca agcagggagc tagaacgatt 900 
cgcagttaat cctggcctgt tagaaacatc agaaggctgt agacaaatac tgggacagct 96 0 
acaaccatcc cttcagacag gatcagagga gcttcgatca ctatacaaca cagtagcaac 102 0 
cctctattgt gtgcaccagc ggatcgagat caaggacacc aaggaagctt tagacaagat 1080 
agaggaagag caaaacaagt ccaagaagaa ggcccagcag gcagcagctg acacaggaca 1140 
cagcaatcag gtcagccaaa attaccctat agtgcagaac atccaggggc aaatggtaca 1200 
tcaggccata tcacctagaa ctttaaatgc atgggtaaaa gtagtagaag agaaggcttt 1260 
cagcccagaa gtgataccca tgttttcagc attatcagaa ggagccaccc cacaggacct 132 0 
gaacacgatg ttgaacaccg tggggggaca tcaagcagcc atgcaaatgt taaaagagac 138 0 
catcaatgag gaagctgcag aatgggatag agtgcatcca gtgcatgcag ggcctattgc 144 0 
accaggccag atgagagaac caaggggaag tgacatagca ggaactacta gtacccttca 1500 
ggaacaaata ggatggatga caaataatcc acctatccca gtaggagaga tctacaagag 156 0 
gtggataatc ctgggattga acaagatcgt gaggatgtat agccctacca gcattctgga 162 0 
cataagacaa ggaccaaagg aaccctttag agactatgta gaccggttct ataaaactct 168 0 
aagagctgag caagcttcac aggaggtaaa aaattggatg acagaaacct tgttggtcca 174 0 
aaatgcgaac ccagattgta agaccatcct gaaggctctc ggcccagcgg ctacactaga 18 0 0 
agaaatgatg acagcatgtc agggagtagg aggacccggc cataaggcaa gagttttggc 18 6 0 
cgaggcgatg agccaggtga cgaactcggc gaccataatg atgcagagag gcaacttccg 192 0 
gaaccagcgg aagatcgtca agtgcttcaa ttgtggcaaa gaagggcaca ccgccaggaa 198 0 
ctgccgggcc ccccggaaga agggctgttg gaaatgtgga aaggaaggac accaaatgaa 2 04 0 
agattgtact gagagacagg ctaatttttt agggaagatc tggccttcct acaagggaag 210 0 
gccagggaat tttcttcaga gcagaccaga gccaacagcc ccaccagaag agagcttcag 2160 
gtctggggta gagacaacaa ctccccctca gaagcaggag ccgatagaca aggaactgta 222 0 
tcctttaact tccctcagat cactctttgg caacgacccc tcgtcacagt aaggatcggg 228 0 
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t gggcaactca aggaagcgct gctcgataca 
agtttgccag gaagatggaa accaaaaatg 
aggcagtacg accagatact catagaaatc 
gtaggaccta cacctgtcaa cataattgga 
ttgaacttcc ccatcagccc tattgagacg 
ggccccaagg tcaagcaatg gccattgacg 
tgtacagaga tggagaagga agggaagatc 
actccagtct tcgcaatcaa gaagaaggac 
agagagctga acaagagaac tcaggacttc 
gctgggttga agaagaagaa gtcagtgaca 
gttcccttgg acgaggactt caggaagtac 
gagacaccag gcatccgcta ccagtacaac 
gccatctttc aaagcagcat gaccaagatc 
atcgtgatct atcagtacat ggacgacctc 
cacaggacca agatcgagga gctgagacag 
gacaagaagc accagaagga acctcccttc 
aagtggacag tgcagcccat cgtgctgcct 
cagaagctcg tgggcaagtt gaactgggca 
cagctgtgca agctgcttcg aggaaccaag 
gaagcagagc tagaactggc agagaaccga 
tactacgacc caagcaagga cctgatcgca 
acctaccaaa tctaccagga gcccttcaag 
aggggtgccc acaccaacga tgtgaagcag 
gagagcatcg tgatctgggg caagactccc 
tgggagacat ggtggaccga gtactggcaa 
aacacccctc ccttggtgaa actgtggtat 
gagaccttct acgtggatgg ggcagccaac 
gtgaccaacc gaggacgaca gaaagtggtg 
gagctgcaag ccatctacct agctctgcaa 
gactcacagt acgcactggg catcatccaa 
gtgaaccaga tcatcgagca gctgatcaag 
gcacacaaag gaattggagg aaatgaacaa 
aaggtgctgt tcctggacgg gatcgataag 
aactggcgcg ctatggccag cgacttcaac 
gccagctgtg ataaatgtca gctaaaagga 
ccaggaatat ggcagctgga ctgcacgcac 
catgtagcca gtggatatat agaagcagaa 
gcatattttc ttfctaaaatt agcaggaaga 
ggaagcaact tcactggtgc tacggttaag 
gaatttggaa ttccctacaa tccccaatcg 
ctgaagaaga tcatcggaca agtgagggat 
atggcagtgt tcatccacaa cttcaaaaga 
gaaaggatcg tggacatcat cgccaccgac 
accaagatcc agaacttccg ggtgtactac 
ccagcaaagc tcctctggaa gggagagggg 
aaagtggtgc caaggcgcaa ggccaagatc 
gatgattgtg tggcaagfcag acaggatgag 
ccatatggcg ttcgaagcta gcctcgagat 



ggagcagatg atacagtatt agaagaaatg 2340 
atagggggga tcgggggctt catcaaggtg 2400 
tgtggacata aagctatagg tacagtatta 246 0 
agaaatctgt tgacccagat cggctgcacc 2 52 0 
gtgcccgtga agttgaagcc ggggatggac 2 58 0 
aaagagaaga tcaaggcctt agtcgaaatc 2 64 0 
agcaagatcg ggcctgagaa cccctacaac 2 70 0 
agtaccaagt ggagaaagct ggtggacttc 2 76 0 
tgggaagttc agctgggcat cccacatccc 2 82 0 
gtgctggatg tgggtgatgc ctacttctcc 2 88 0 
actgccttca cgatacctag catcaacaac 2 94 0 
gtgctgccac agggatggaa gggatcacca 3 00 0 
ctggagccct tccgcaagca aaacccagac 3 06 0 
tacgtaggaa gtgacctgga gatcgggcag 312 0 
catctgttga ggtggggact gaccacacca 318 0 
ctgtggatgg gctacgaact gcatcctgac 3240 
gagaaggaca gctggactgt gaacgacata 33 00 
agccagatct acccaggcat caaagttagg 3360 
gcactgacag aagtgatccc actgacagag 342 0 
gagatcctga aggagccagt acatggagtg 348 0 
gagatccaga agcaggggca aggccaatgg 3540 
aacctgaaga caggcaagta cgcaaggatg 3 600 
ctgacagagg cagtgcagaa gatcaccaca 3 66 0 
aagttcaagc tgcccataca gaaggagaca 3 72 0 
gccacctgga tccctgagtg ggagttcgtg 3 78 0 
cagctggaga aggaacccat cgtgggagca 3 84 0 
agggagacca agctgggcaa ggcaggctac 390 0 
accctgactg acaccaccaa ccagaagact 3 96 0 
gacagcggac tggaagtgaa catcgtgaca 4 02 0 
gcacaaccag accaatccga gtcagagctg 4 08 0 
aaggagaaag tgtacctggc atgggtacca 414 0 
gtagataaat tagtcagtgc tgggatccgg 42 0 0 
gcccaagatg aacatgagaa gtaccactcc 4260 
ctgccacctg tagtagcaaa agaaatagta 432 0 
gaagccatgc atggacaagt agactgtagt 43 80 
ctggagggga aggtgatcct ggtagcagtt 4440 
gttatccctg ctgaaactgg gcaggaaaca 45 00 
tggccagtaa aaacaataca cacggacaac 4560 
gccgcctgtt ggtgggcggg aatcaagcag 462 0 
caaggagtcg tggagagcat gaacaaggag 46 80 
caggctgagc acctgaagac agcagtgcag 4740 
aaagggggga ttggggggta cagtgcaggg 4800 
atccaaacca aggagctgca gaagcagatc 4860 
cgcgacagcc gcaacccact gtggaaggga 4 92 0 
gcagtggtga tccaggacaa cagtgacatc 4 98 0 
atccgcgact atggaaaaca gatggcaggt 5 04 0 
gattagaacc tggaagagcc tggtgaagca 5100 
ccagatctgc tgtgccttct agttgccagc 5160 
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catcfcgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg 522 0 
tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc 52 80 
tggggggtgg ggtggggcag cacagcaagg gggaggattg ggaagacaat agcaggcatg 534 0 
ctggggafcgc ggtgggctct atgggtaccc aggtgctgaa gaattgaccc ggttcctcct 54 00 
gggccagaaa gaagcaggca catccccttc tctgtgacac accctgtcca cgcccctggt 546 0 
tcttagttcc agccccactc ataggacact catagctcag gagggctccg ccttcaatcc 552 0 
cacccgctaa agtacttgga gcggtctctc cctccctcat cagcccacca aaccaaacct 5580 
agcctccaag agtgggaaga aattaaagca agataggcta ttaagtgcag agggagagaa 564 0 
aatgcctcca acatgtgagg aagtaatgag agaaatcata gaatttcttc cgcttcctcg 5700 
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 5760 
gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 5820 
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 5880 
cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 5940 
ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 6000 
accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 6060 
caatgctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 6120 
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 6180 
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 624 0 
agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 63 00 
actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 6360 
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 642 0 
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 648 0 
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 6540 
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 6600 
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 6660 
gcgatcfcgtc tatttcgttc atccatagtt gcctgactcc gggggggggg ggcgctgagg 6 72 0 
tctgcctcgt gaagaaggtg ttgctgactc ataccaggcc tgaatcgccc catcatccag 6 78 0 
ccagaaagtg agggagccac ggttgatgag agctttgttg taggtggacc agttggtgat 6 84 0 
tttgaacttt tgctttgcca cggaacggtc tgcgttgtcg ggaagatgcg tgatctgatc 6900 
cttcaactca gcaaaagttc gatttattca acaaagccgc cgtcccgtca agtcagcgta 696 0 
atgctctgcc agtgttacaa ccaattaacc aattctgatt agaaaaactc atcgagcatc 7 02 0 
aaatgaaact gcaatttatt catatcagga ttatcaatac catatttttg aaaaagccgt 7 08 0 
ttctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag atcctggtat 714 0 
cggtctgcga ttccgactcg tccaacatca atacaaccta ttaatttccc ctcgtcaaaa 72 00 
ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga gaatggcaaa 72 6 0 
agcttatgca tttctttcca gacttgttca acaggccagc cattacgctc gtcatcaaaa 732 0 
tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag acgaaatacg 73 8 0 
cgatcgctgt taaaaggaca attacaaaca ggaatcgaat gcaaccggcg caggaacact 744 0 
gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac ctggaatgct 7500 
gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg gataaaatgc 756 0 
ttgatggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat cfccatctgta 762 0 
acatcattgg caacgctacc tttgccatgt ttcagaaaca actctggcgc atcgggcttc 768 0 
ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc ccatttatac 7740 
ccatataaat cagcatccat gttggaattt aatcgcggcc tcgagcaaga cgtttcccgt 7 8 00 
tgaatatggc tcataacacc ccttgtatta ctgtttatgt aagcagacag tttfcattgtt 7 860 
catgatgafca tatttttatc ttgtgcaatg taacatcaga gattttgaga cacaacgtgg 792 0 
ctttcccccc ccccccatta ttgaagcatt tatcagggtt attgtctcat gagcggatac 798 0 
atatttgaat gtatttagaa aaataaacaa ataggggfctc cgcgcacatt tccccgaaaa 8 040 
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gtgccacctg acgtctaaga aaccattatt 
atcacgaggc cctttcgtct cgcgcgtttc 
cagctcccgg agacggtcac agcttgtctg 
cagggcgcgt cagcgggtgt tggcgggtgt 
cagattgtac tgagagtgca ccatatgcgg 
aaataccgca tcagatfcggc tattgg 



atcatgacat taacctataa aaataggcgt 8100 
ggtgatgacg gtgaaaacct ctgacacatg 8160 
taagcggatg ccgggagcag acaagcccgt 8220 
cggggctggc ttaactatgc ggcatcagag 8280 
tgtgaaatac cgcacagatg cgtaaggaga 834 0 

8366 



<210> 7 
<211> 271 
<212> PRT 

<213> Escherichia coli 
<400> 7 

Met Ser His lie Gin Arg Glu Thr Ser Cys Ser Arg Pro Arg Leu Asn 
15 10 15 

Ser Asn Met Asp Ala Asp Leu Tyr Gly Tyr Lys Trp Ala Arg Asp Asn 

20 25 30 

Val Gly Gin Ser Gly Ala Thr lie Tyr Arg Leu Tyr Gly Lys Pro Asp 

35 40 45 

Ala Pro Glu Leu Phe Leu Lys His Gly Lys Gly Ser Val Ala Asn Asp 
50 55 60 

Val Thr Asp Glu Met Val Arg Leu Asn Trp Leu Thr Glu Phe Met Pro 
65 70 75 80 

Leu Pro Thr lie Lys His Phe lie Arg Thr Pro Asp Asp Ala Trp Leu 

85 90 95 

Leu Thr Thr Ala lie Pro Gly Lys Thr Ala Phe Gin Val Leu Glu Glu 

100 105 110 

Tyr Pro Asp Ser Gly Glu Asn lie Val Asp Ala Leu Ala Val Phe Leu 
115 120 125 

Arg Arg Leu His Ser lie Pro Val Cys Asn Cys Pro Phe Asn Ser Asp 
130 135 140 

Arg Val Phe Arg Leu Ala Gin Ala Gin Ser Arg Met Asn Asn Gly Leu 
145 150 155 160 

Val Asp Ala Ser Asp Phe Asp Asp Glu Arg Asn Gly Trp Pro Val Glu 

165 170 175 

Gin Val Trp Lys Glu Met His Lys Leu Leu Pro Phe Ser Pro Asp Ser 
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180 

Val Val Thr His 
195 

Gly Lys Leu lie 
210 

Arg Tyr Gin Asp 
225 

Pro Ser Leu Gin 



Asp Met Asn Lys 

260 



Gly Asp Phe Ser 

200 

Gly Cys lie Asp 
215 

Leu Ala lie Leu 
230 

Lys Arg Leu Phe 
245 

Leu Gin Phe His 



185 

Leu Asp Asn Leu 



Val Gly Arg Val 

220 

Trp Asn Cys Leu 
235 

Gin Lys Tyr Gly 
250 

Leu Met Leu Asp 
265 



190 

lie Phe Asp Glu 
205 

Gly lie Ala Asp 

Gly Glu Phe Ser 

240 

lie Asp Asn Pro 
255 

Glu Phe Phe 
270 



<210> 8 
<211> 8937 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
of transfer construe pmBCwCNluci 

<400> 8 

tggaagggct aatttggtcc caaaaaagac aagagatcct tgatctgtgg atctaccaca 6 0 
cacaaggcta cttccctgat tggcagaact acacaccagg gecagggate agatatccac 12 0 
tgacctttgg atggtgcttc aagttagtac cagttgaacc agagcaagta gaagaggeca 180 
aataaggaga gaagaacagc ttgttacacc etatgageca gcatgggatg gaggaccegg. 24 0 
agggagaagt attagtgtgg aagtttgaca gcctcctagc atttegtcac atggcccgag 3 00 
agctgcatcc ggagtactac aaagactget gacatcgagc tttctacaag ggactttccg 360 
ctggggactt tccagggagg tgtggcctgg gcgggactgg ggagtggcga gccctcagat 420 
gctacatata ageagctget ttttgcctgt actgggtctc tctggttaga ccagatctga 480 
gectgggage tctctggcta actagggaac ccactgctta agectcaata aagcttgect 540 
tgagtgctca aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 600 
agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 660 
cgaaagtaaa gecagaggag atctctcgac gcaggactcg gettgetgaa gcgcgcacgg 72 0 
caagaggega ggggeggege ctgacgagga cgccaaaaat tttgactagc ggaggctaga 78 0 
a 9"g a <J a £J a £J c teggtgegag agegtcagta ttaagcgggg gagaattaga tcgatgggaa 84 0 
aaaattcggt taaggecagg gggaaagaaa aaatataaat taaaacatat agtatgggca 900 
agcagggagc tagaacgatt cgcagttaat cctggcctgt tagaaacatc agaaggctgt 960 
agacaaatac tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca 102 0 
ttatataata cagtagcaac cctctattgt gtgcatcaaa ggatagagat aaaagacacc 1080 
aaggaagctt tagacaagat agaggaagag caaaacaaaa gtaagaaaaa agcacagcaa 1140 
gcagcagctg acacaggaca cagcaatcag gtcagccaaa attaccctat agtgcagaac 12 00 
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atcchggggc aaatggtaca tcaggccata 
agttccgcgfc tacataactt acggtaaatg 
gcccattgac gtcaataatg acgtatgttc 
gacgtcaatg ggtggagtat ttacggtaaa 
atatgccaag tacgccccct attgacgtca 
cccagtacat gaccttatgg gactttccta 
ctattaccat ggtgatgcgg ttttggcagt 
cacggggafct tccaagtctc caccccattg 
atcaacggga ctttccaaaa tgtcgtaaca 
ggcgtgtacg gtgggaggtc tatataagca 
ggagacgcca tccacgctgt tttgacctcc 
taagtaagct tggcattccg gtactgttgg 
aggcccggcg ccattctatc ctctagagga 
tatgaagaga tacgccctgg ttcctggaac 
gaacatcacg tacgcggaat acttcgaaat 
atatgggctg aatacaaatc acagaatcgt 
tatgccggtg ttgggcgcgt tatttatcgg 
taatgaacgt gaattgctca acagtatgaa 
caaaaagggg ttgcaaaaaa tttfcgaacgt 
tattatcatg gattctaaaa cggattacca 
atctcatcta cctcccggtt ttaatgaata 
caaaacaatt gcactgataa tgaattcctc 
ccttccgcat agaactgcct gcgtcagatt 
tcaaatcatt ccggatactg cgattttaag 
gtttactaca ctcggatatt tgatatgtgg 
agaagagctg tttttacgat cccttcagga 
aaccctattt tcattcttcg ccaaaagcac 
acacgaaatt gcttctgggg gcgcacctct 
acgcttccat cttccaggga tacgacaagg 
tctgafctaca cccgaggggg atgataaacc 
tgaagcgaag gttgtggatc tggataccgg 
attatgtgtc agaggaccta tgattatgtc 
cgccttgatt gacaaggatg gatggctaca 
cgaacacttc ttcatagttg accgcttgaa 
ggcccccgct gaattggaat cgatattgtt 
ggcaggtctt cccgacgatg acgccggtga 
cggaaagacg atgacggaaa aagagatcgt 
gaaaaagttg cgcggaggag ttgtgtttgt 
actcgacgca agaaaaatca gagagatcct 
attgtaactc gagggggggc ccggtacctt 
atcttagcca ctttttaaaa gaaaaggggg 
gacaagatat ccttgatctg tggatctacc 
actacacacc agggccaggg gtcagatatc 
taccagttga gccagataag gtagaagagg 
accctgtgag cctgcatgga atggatgacc 
acagccgcct agcatttcat cacgtggccc 
gctgacatcg agcttgctac aagggacttt 
tgggcgggac tggggagtgg cgagccctca 



tcacctagaa ctttaaacga taagcttggg 1260 
gcccgcctgg ctgaccgccc aacgaccccc 1320 
ccatagtaac gccaataggg actttccatt 138 0 
ctgcccactt ggcagtacat caagtgtatc 1440 
atgacggtaa atggcccgcc tggcattatg 1500 
cttggcagta catctacgta ttagtcatcg 1560 
acatcaatgg gcgtggatag cggtttgact 1620 
acgtcaatgg gagtttgttt tggcaccaaa 1680 
actccgcccc attgacgcaa atgggcggta 1740 
gagctcgttt agtgaaccgt cagatcgcct 18 00 
atagaagaca ccgactctag aggatccatc 186 0 
taaaatggaa gacgccaaaa acafcaaagaa 192 0 
tggaaccgct ggagagcaac tgcataaggc 198 0 
aattgctttt acagatgcac atatcgaggt 2 04 0 
gtccgttcgg ttggcagaag ctatgaaacg 2100 
cgtatgcagt gaaaactctc ttcaattctfc 2160 
agttgcagtt gcgcccgcga acgacattta 222 0 
catttcgcag cctaccgtag tgfcttgttfcc 2280 
gcaaaaaaaa ttaccaataa tccagaaaat 234 0 
gggatttcag tcgatgtaca cgttcgtcac 24 0 0 
cgattttgta ccagagtcct ttgatcgtga 246 0 
tggatctact gggttaccta agggtgtggc 252 0 
ctcgcatgcc agagatccta tttttggcaa 2 58 0 
tgttgttcca ttccatcacg gttttggaat 2 64 0 
atttcgagtc gtcttaatgt atagatttga 2 700 
ttacaaaatt caaagtgcgt tgctagtacc 2 76 0 
tctgattgac aaatacgatt tatctaattt 2 82 0 
ttcgaaagaa gtcggggaag cggttgcaaa 2 88 0 
atatgggctc actgagacta catcagctat 2 94 0 
gggcgcggtc ggtaaagttg ttccattttt 3 000 
gaaaacgctg ggcgttaatc agagaggcga 3060 
cggtfcatgta aacaatccgg aagcgaccaa 312 0 
ttctggagac atagcttact gggacgaaga 318 0 
gtctttaatt aaatacaaag gatatcaggt 324 0 
acaacacccc aacatcttcg acgcgggcgt 33 0 0 
acttcccgcc gccgttgttg ttttggagca 3360 
ggattacgtc gccagtcaag taacaaccgc 3420 
ggacgaagta ccgaaaggtc ttaccggaaa 348 0 
cataaaggcc aagaagggcg gaaagtccaa 3 54 0 
taagaccaat gacttacaag gcagctgtag 36 00 
gactggaagg gctaattcac tcccaaagaa 366 0 
acacacaagg ctacttccct gattggcaga 3 72 0 
cactgacctt tggatggtgc tacaagctag 3 78 0 
ccaataaagg agagaacacc agcttgttac 3 84 0 
ctgagagaga agtgttagag tggaggtttg 3 90 0 
gagagctgca tccggagtac ttcaagaact 396 0 
ccgctgggga ctttccaggg aggcgtggcc 4020 
gatgctgcat ataagcagct gctttttgcc 4080 
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tgtactgggt ctctctggtt agaccagatc 
aacccactgc ttaagcctca ataaagcttg 
ctgttgtgtg actctggtaa ctagagatcc 
tctagcaccc cccaggaggt agaggttgca 
cctgggcaag aaaacaagac tgtctaaaat 
tttatacatg gaggtcataa aaatatatat 
cgcccggccc tttgggaggc cgaggcaggt 
gcctgaccaa catggagaaa ccccttctct 
tattttattc acaggtattt ctggaaaact 
aagaatcatc agcacagagg aagacttctg 
caccagcaca tgagcagtca gttctgccgc 
caacaccgcc tgcctggaga gaggtcagac 
aaacacccaa gacataaaca cccaacaggt 
ttcaccaaga cgggaattag gatagagaaa 
agaacggagt tctattatga ctcaaatcag 
taaggataac ttagtgtgta gggggccagt 
ggtgtccttt tgcgccgagt cagttcctgg 
tatcaatccg ggggtgccag ctgatccatg 
ctgattgatc ttaggtttta caatagtgat 
gaatcttgta gcctgtagct gcatgactcc 
ttttattttt gagacagggt ctcactctgt 
cagctcactg cagcccctag agcggccgcc 
tgagtcgtat tacaattcac tggccgtcgt 
cgttacccaa cttaatcgcc ttgcagcaca 
agaggcccgc accgatcgcc cttcccaaca 
aattgtaaac gttaatattt tgttaaaatt 
ttttaaccaa taggccgaaa tcggcaaaat 
agggttgagt gttgttccag tttggaacaa 
cgtcaaaggg cgaaaaaccg tctatcaggg 
atcaagtttt ttggggtcga ggtgccgtaa 
ccgatttaga gcttgacggg gaaagccggc 
gaaaggagcg ggcgctaggg cgctggcaag 
acccgccgcg cttaatgcgc cgctacaggg 
gtgcgcggaa cccctatttg tttatttttc 
agacaataac cctgataaat gcttcaataa 
catttccgtg tcgcccttat tccctttttt 
ccagaaacgc tggtgaaagt aaaagatgct 
atcgaactgg atctcaacag cggtaagatc 
ccaatgatga gcacttttaa agttctgcta 
gggcaagagc aactcggtcg ccgcatacac 
ccagtcacag aaaagcatct tacggatggc 
ataaccatga gtgataacac tgcggccaac 
gagctaaccg cttttttgca caacatgggg 
ccggagctga atgaagccat accaaacgac 
gcaacaacgt tgcgcaaact attaactggc 
ttaatagact ggatggaggc ggataaagtt 
gctggctggt ttattgctga taaatctgga 
gcagcactgg ggccagatgg taagccctcc 



tgagcctggg agctctctgg ctaactaggg 4140 
ccttgagtgc ttcaagtagt gtgtgcccgt 4200 
ctcagaccct tttagtcagt gtggaaaatc 4260 
gtgagccaag atcgcgccac tgcattccag 432 0 
aataataata agttaagggt attaaatata 43 8 0 
atttgggctg ggcgcagtgg ctcacacctg 4440 
ggatcacctg agtttgggag ttccagacca 4500 
gtgtattttt agtagatttt attttatgtg 4560 
gaaactgttt ttcctctact ctgataccac 4620 
tgatcaaatg tggtgggaga gggaggtttt 4680 
agactcggcg ggtgtccttc ggttcagttc 4 74 0 
cacagggtga gggctcagtc cccaagacat 48 00 
ccaccccgcc tgctgcccag gcagagccga 4860 
gagtaagtca cacagagccg gctgtgcggg 492 0 
tctccccaag cattcgggga tcagagtfctt 4 98 0 
gagttggaga tgaaagcgta gggagtcgaa 5 04 0 
gtgggggcca caagatcgga tgagccagtt 510 0 
gagtgcaggg tctgcaaaat atctcaagca 5160 
gttaccccag gaacaatttg gggaaggtca 522 0 
taaaccataa tttctttttt gttttttttt 5280 
cacctaggct ggagtgcagt ggtgcaatca 534 0 
accgcggtgg agctccaatt cgccctatag 54 00 
tttacaacgt cgtgactggg aaaaccctgg 5460 
tccccctttc gccagctggc gtaatagcga 552 0 
gttgcgcagc ctgaatggcg aatggcgcga 558 0 
cgcgttaaat ttttgttaaa tcagctcatt 5640 
cccttataaa tcaaaagaat agaccgagat 5700 
gagtccacta ttaaagaacg tggactccaa 5760 
cgatggccca ctacgtgaac catcacccta 582 0 
agcactaaat cggaacccta aagggagccc 5880 
gaacgtggcg agaaaggaag ggaagaaagc 5940 
tgtagcggtc acgctgcgcg taaccaccac 6000 
cgcgtcccag gtggcacttt tcggggaaat 6 060 
taaatacatt caaatatgta tccgctcatg 612 0 
tattgaaaaa ggaagagtat gagtattcaa 618 0 
gcggcatttt gccttcctgt ttttgctcac 6240 
gaagatcagt tgggtgcacg agtgggttac 63 00 
cttgagagtt ttcgccccga agaacgtttt 6360 
tgtggcgcgg tattatcccg tattgacgcc 642 0 
tattctcaga atgacttggt tgagtactca 64 8 0 
atgacagtaa gagaattatg cagtgctgcc 6540 
ttacttctga caacgatcgg aggaccgaag 6600 
gatcatgtaa ctcgccttga tcgttgggaa 666 0 
gagcgtgaca ccacgatgcc tgtagcaatg 6 72 0 
gaactactta ctctagcttc ccggcaacaa 6 78 0 
gcaggaccac ttctgcgctc ggccctfcccg 684 0 x 
gccggtgagc gtgggtctcg cggtatcatt 69 00 
cgtatcgtag ttatctacac gacggggagt 696 0 
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caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 702 0 
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 7080 
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 7140 
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 72 0 0 
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 72 6 0 
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 7320 
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 73 8 0 
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 7440 
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 7500 
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 7560 
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 762 0 
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 7680 
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 7 74 0 
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 78 0 0 
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 7860 
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 7920 
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcccaata 798 0 
cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt 8 040 
cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct cactcattag 8100 
gcaccccagg ctttacactt tatgcttccg gctcgtatgt tgtgtggaat tgtgagcgga 8160 
taacaatttc acacaggaaa cagctatgac catgattacg ccaagctcgg aattaaccct 8220 
cactaaaggg aacaaaagct gcfcgcagggt ccctaactgc caagccccac agtgtgccct 82 80 
gaggctgccc cttccttcta gcggctgccc ccactcggct ttgctttccc tagtttcagt 8340 
tacttgcgtt cagccaaggt ctgaaactag gtgcgcacag agcggtaaga ctgcgagaga 84 00 
aagagaccag ctttacaggg ggtttatcac agtgcaccct gacagtcgtc agcctcacag 8460 
ggggtttatc acattgcacc ctgacagtcg tcagcctcac agggggttta tcacagtgca 852 0 
cccttacaat cattccattt gattcacaat ttttttagtc tctactgtgc ctaacttgta 8 58 0 
agttaaattt gatcagaggt gtgttcccag aggggaaaac agtatataca gggttcagta 864 0 
. ctatcgcatt tcaggcctcc acctgggtct tggaatgtgt cccccgaggg gtgatgacta 8700 
cctcagttgg atctccacag gtcacagtga cacaagataa ccaagacacc tcccaaggct 876 0 
accacaatgg gccgccctcc acgtgcacat ggccggagga actgccatgt cggaggtgca 882 0 
agcacacctg cgcatcagag tccttggtgt ggagggaggg accagcgcag cttccagcca 888 0 
tccacctgat gaacagaacc tagggaaagc cccagttcta cttacaccag gaaaggc 893 7 



<210> 9 
<211> 8937 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> Description of Artificial Sequence: DNA sequence 
from transfer construct pmBCmCNluci 



<400> 9 

tggaagggct aatttggtcc caaaaaagac 

cacaaggcta cttccctgat tggcagaact 

tgacctttgg atggtgcttc aagttagtac 



aagagatcct tgatctgtgg atctaccaca 6 0 
acacaccagg gccagggatc agatatccac 12 0 
cagttgaacc agagcaagta gaagaggcca 18 0 
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aataaggaga gaagaacagc ttgttacacc ctatgagcca gcatgggatg gaggacccgg 24 0 
a 9"g"9"a-9"a-3-9"t attagtgtgg aagtttgaca gcctcctagc atttcgtcac atggcccgag 3 00 
agctgcatcc ggagtactac aaagactgct gacatcgagc tttctacaag ggactttccg 3 60 
ctggggactt tccagggagg tgtggcctgg gcgggactgg ggagtggcga gccctcagat 42 0 
gctacatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 48 0 
gcctgggagc tctcfcggcta actagggaac ccactgctta agcctcaata aagcttgcct 540 
tgagtgctca aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 6 00 
agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 660 
cgaaagtaaa gccagaggag atctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 72 0 
caagaggcga ggggcggcgc ctgacgagga cgccaaaaat tttgactagc ggaggctaga 78 0 
aggagagagc tcggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgatgggaa 84 0 
aaaattcggt taaggccagg gggaaagaag aagtacaagc taaagcacat cgtatgggca 900 
agcagggagc tagaacgatt cgcagttaat cctggcctgt tagaaacatc agaaggctgt 960 
agacaaatac tgggacagct acaaccatcc cttcagacag gatcagagga gcttcgatca 102 0 
ctatacaaca cagtagcaac cctctattgt gtgcaccagc ggatcgagat caaggacacc 108 0 
aaggaagctt tagacaagat agaggaagag caaaacaagt ccaagaagaa ggcccagcag 1140 
gcagcagctg acacaggaca cagcaatcag gtcagccaaa attaccctat agtgcagaac 12 0 0 
atccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaacga taagcttggg 1260 
agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc aacgaccccc 132 0 
gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg actttccatt 13 8 0 
gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt ggcagtacat caagtgtatc 144 0 
atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc tggcattatg 150 0 
cccagtacat gaccttatgg gactttccta cttggcagta catctacgta ttagtcatcg 156 0 
ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag cggtttgact 162 0 
cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt tggcaccaaa 168 0 
atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa atgggcggta 174 0 
ggcgtgtacg 9"tgggaggtc tatataagca gagctcgttt agtgaaccgt cagatcgcct 180 0 
ggagacgcca tccacgctgt tttgacctcc atagaagaca ccgactctag aggatccatc 1860 
taagtaagct tggcattccg gtactgttgg taaaatggaa gacgccaaaa acataaagaa 192 0 
aggcccggcg ccattctatc ctctagagga tggaaccgct ggagagcaac tgcataaggc 198 0 
tatgaagaga tacgccctgg ttcctggaac aattgctttt acagatgcac atatcgaggt 2 04 0 
gaacatcacg tacgcggaat acttcgaaat gtccgttcgg ttggcagaag ctatgaaacg 210 0 
atatgggctg aatacaaatc acagaatcgt cgtatgcagt gaaaactctc ttcaattctt 2160 
tatgccggtg ttgggcgcgt tatttatcgg agttgcagtt gcgcccgcga acgacattta 222 0 
taatgaacgt gaattgctca acagtatgaa catttcgcag cctaccgtag tgtttgtttc 22 8 0 
caaaaagggg ttgcaaaaaa ttttgaacgt gcaaaaaaaa ttaccaataa tccagaaaat 2340 
tattatcatg gattctaaaa cggattacca gggatttcag tcgatgtaca cgttcgtcac 24 00 
atctcatcta cctcccggtt ttaatgaata cgattttgta ccagagtcct ttgatcgtga 2460 
caaaacaatt gcactgataa tgaattcctc tggatctact gggttaccta agggtgtggc 2520 
ccttccgcat agaactgcct gcgtcagatt ctcgcatgcc agagatccta tttttggcaa 2580 
tcaaatcatt ccggatactg cgatttfcaag tgttgttcca ttccatcacg gttttggaat 2640 
gtttactaca ctcggatatt tgatatgtgg atttcgagtc gtcttaatgt atagatttga 2700 
agaagagctg tttttacgat cccttcagga ttacaaaatt caaagtgcgt tgctagtacc 2 760 
aaccctattt tcattcttcg ccaaaagcac tctgattgac aaatacgatt tatctaattt 2820 
acacgaaatt gcttctgggg gcgcacctct ttcgaaagaa gtcggggaag cggttgcaaa 2880 
acgcttccat cttccaggga tacgacaagg atatgggctc actgagacta catcagctat 294 0 
tctgattaca cccgaggggg atgataaacc gggcgcggtc ggtaaagttg ttccattttt 3 0 00 
tgaagcgaag gttgtggatc tggataccgg gaaaacgctg ggcgttaatc agagaggcga 3 060 
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attafzgtgtc agaggaccta tgattatgtc cggttatgta aacaatccgg aagcgaccaa 312 0 
cgccttgatt gacaaggatg gatggctaca ttctggagac atagcttact gggacgaaga 318 0 
cgaacacttc ttcatagttg accgcttgaa gtctttaatt aaatacaaag gatatcaggt 324 0 
ggcccccgct gaattggaat cgatattgtt acaacacccc aacatcttcg acgcgggcgt 33 0 0 
ggcaggtctt cccgacgatg acgccggtga acttcccgcc gccgttgttg ttttggagca 33 6 0 
cggaaagacg atgacggaaa aagagatcgt ggattacgtc gccagtcaag taacaaccgc 3420 
gaaaaagttg cgcggaggag ttgtgtttgt ggacgaagta ccgaaaggtc ttaccggaaa 348 0 
actcgacgca agaaaaatca gagagatccfc cataaaggcc aagaagggcg gaaagtccaa 354 0 
attgtaactc gagggggggc ccggtacctt taagaccaat gacttacaag gcagctgtag 3 600 
atcttagcca ctttttaaaa gaaaaggggg gactggaagg gctaattcac tcccaaagaa 3 660 
gacaagatat ccttgatctg tggatctacc acacacaagg ctacttccct gattggcaga 3 72 0 
actacacacc agggccaggg gtcagatatc cactgacctt tggatggtgc tacaagctag 3 78 0 
taccagttga gccagataag gtagaagagg ccaataaagg agagaacacc agcttgttac 3 84 0 
accctgtgag cctgcatgga atggatgacc ctgagagaga agtgttagag tggaggtttg 3 900 
acagccgcct agcatttcat cacgtggccc gagagctgca tccggagtac ttcaagaact 3 96 0 
gctgacatcg agcttgctac aagggacttt ccgctgggga ctttccaggg aggcgtggcc 4 02 0 
tgggcgggac tggggagtgg cgagccctca gatgctgcat ataagcagct gctttttgcc 4 08 0 
tgtactgggt ctctctggtt agaccagatc tgagcctggg agctctctgg ctaactaggg 414 0 
aacccactgc ttaagcctca ataaagcttg ccttgagtgc ttcaagtagt gtgtgcccgt 42 00 
ctgttgtgtg actctggtaa ctagagatcc ctcagaccct tttagtcagt gtggaaaatc 426 0 
tctagcaccc cccaggaggt agaggttgca gtgagccaag atcgcgccac tgcattccag 4320 
cctgggcaag aaaacaagac tgtctaaaat aataataata agttaagggt attaaatata 43 8 0 
tttatacatg gaggtcataa aaatatatat atttgggctg ggcgcagtgg ctcacacctg 444 0 
cgcccggccc tttgggaggc cgaggcaggt ggatcacctg agtttgggag ttccagacca 4 50 0 
gcctgaccaa catggagaaa ccccttctct gtgtattttt agtagatttt attttatgtg 4 56 0 
tattttattc acaggtattt ctggaaaact gaaactgttt ttcctctact ctgataccac 462 0 
aagaatcatc agcacagagg aagacttctg tgatcaaatg tggtgggaga gggaggtttt 468 0 
caccagcaca tgagcagtca gttctgccgc agactcggcg ggtgtccttc ggttcagttc 474 0 
caacaccgcc tgcctggaga gaggtcagac cacagggtga gggctcagtc cccaagacat 48 0 0 
aaacacccaa gacataaaca cccaacaggt ccaccccgcc tgctgcccag gcagagccga 48 6 0 
ttcaccaaga cgggaattag gatagagaaa gagtaagtca cacagagccg gctgtgcggg 492 0 
agaacggagt tctattatga ctcaaatcag tctccccaag cattcgggga tcagagtttt 498 0 
taaggataac ttagtgtgta gggggccagt gagttggaga tgaaagcgta gggagtcgaa 5 04 0 
ggtgtccttt tgcgccgagt cagttcctgg gtgggggcca caagatcgga tgagccagtt 510 0 
tatcaatccg ggggtgccag ctgatccatg gagtgcaggg tctgcaaaat atctcaagca 516 0 
ctgattgatc ttaggtttta caatagtgat gttaccccag gaacaatttg gggaaggtca 522 0 
gaatcttgta gcctgtagct gcatgactcc taaaccataa tttctttttt gttttttttt 5280 
ttttattttt gagacagggt ctcactctgt cacctaggct ggagtgcagt ggtgcaatca 534 0 
cagctcactg cagcccctag agcggccgcc accgcggtgg agctccaatt cgccctatag 54 0 0 
tgagtcgtat tacaattcac tggccgtcgt tttacaacgt cgtgactggg aaaaccctgg 5460 
cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc gtaatagcga 552 0 
agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg aatggcgcga 558 0 
aattgtaaac gttaatattt fcgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt 564 0 
ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat agaccgagat 570 0 
^ggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg tggactccaa 5 76 0 
cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac catcacccta 5 82 0 
atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta aagggagccc 588 0 
ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc 5940 
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gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg taaccaccac 60 0 0 
acccgccgcg cttaatgcgc cgctacaggg cgcgtcccag gtggcacttt tcggggaaat 6060 
gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta tccgctcatg 612 0 
agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagfcattcaa 618 0 
catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac 6240 
ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac 63 0 0 
atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt 63 6 0 
ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg tattgacgcc 642 0 
gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca 6480 
ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc 654 0 
ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag 66 0 0 
gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa 6660 
ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg 6720 
gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa 67 8 0 
ttaatagact ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg 684 0 
gctggctggt ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt 6900 
gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt 696 0 
caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag 7 02 0 
cattggtaac tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat 7080 
ttttaattta aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct 714 0 
taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct 72 00 
tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca 72 6 0 
gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc 732 0 
agcagagcgc agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc 73 8 0 
aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct 744 0 
gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag 7500 
gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc 756 0 
tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg 762 0 
agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag 768 0 
cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt 7 740 
gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac 78 00 
gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg 7860 
ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga taccgctcgc 792 0 
cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga gcgcccaata 7980 
cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt 8 04 0 
cccgactgga aagcgggcag tgagcgcaac gcaattaatg tgagttagct cac teat tag 810 0 
gcaccccagg ctttacactt tatgettccg getegtatgt tgtgtggaat tgtgagcgga 816 0 
taacaatttc acacaggaaa cagctatgac catgattacg ccaagctcgg aattaaccct 822 0 
cactaaaggg aacaaaagct getgeagggt ccctaactgc caagccccac agtgtgccct 82 8 0 
gaggctgccc cttccttcta gcggctgccc ccactcggct ttgctttccc tagtttcagt 834 0 
tacttgegtt cagecaaggt ctgaaactag gtgcgcacag ageggtaaga ctgegagaga 84 0 0 
aagagaccag ctttacaggg ggtttatcac agtgcaccct gaeagtegtc agcctcacag 846 0 
ggggtttatc acattgcacc ctgacagtcg tcagcctcac agggggttta teacagtgea 852 0 
cccttacaat cattccattt gattcacaat ttttttagtc tctactgtgc ctaacttgta 8580 
agttaaattt gatcagaggt gtgttcccag aggggaaaac agtatataca gggttcagta 8640 
etategcatt tcaggcctcc acctgggtct tggaatgtgt cccccgaggg gtgatgacta 8700 
cctcagttgg atctccacag gtcacagtga cacaagataa ccaagacacc tcccaaggct 8 76 0 
accacaatgg gccgccctcc aegtgeacat ggccggagga actgecatgt eggaggtgea 8 82 0 
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agcacacctg cgcatcagag tccttggtgt ggagggaggg accagcgcag cttccagcca 88 80 
tccacctgat gaacagaacc tagggaaagc eccagttcta cttacaccag gaaaggc 8937 



<210> 10 
<211> 122 
<212> DNA 

<213> Artificial Sequence 

4 

<220> 

<223> Description of Artificial Sequence: DNA sequence 
of the BSSHII to Clal fragment in transfer 
construct pmBCwCNluci and pmBCmCNluci 

<400> 10 

cgcgcacggc aagaggcgag gggcggcgcc tgacgaggac gccaaaaatt ttgactagcg 6 0 
gaggctagaa ggagagagct cggtgcgaga gcgtcagtat taagcggggg agaattagat 12 0 
eg 122 



<210> 11 
<211> 122 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
of the BSSHII to Clal fragment in transfer 
construct 3 

<400> 11 

cgcgcacggc aagaggcgag gggcggcgcc tggggaggac gccaaaaatt ttgactagcg 60 
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 12 0 
eg 122 



<210> 12 
<211> 122 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<400> 12 

cgcgcacggc aagaggcgag gggeggegae tggtgagtac gccaaaaatt ttgactatcg 6 0 
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 12 0 
eg 122 



<210> 13 
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<211> 122 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Plurality 

Consensus sequence of DNA sequence of the BSSHII 
to CLal fragment in HIV-1 and transfer constructs 

<400> 13 

cgcgcacggc aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg 60 
gaggctagaa ggagagagat gggtgcgaga gcgtcggtat taagcggggg agaattagat 12 0 
aa 122 



<210> 14 
<211> 122 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
of construct CMVkan / R - R - S I Vgp 16 0 CTE 

<400> 14 

cgcgcacggc aagaggcgag gggcggcgac tggtgagtac gccaaaaatt ttgactagcg 6 0 
gaggctagaa ggagagagat gggtgcgaga gcgtcagtat taagcggggg agaattagat 12 0 
eg 122 



<210> 15 
<211> 6978 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
of construct CMVkan/R-R-SIVgpl6 0 CTE 

<400> 15 

cctggccatt gcatacgttg tatccatatc ataatatgta catttatatt ggctcatgtc 6 0 
caacattacc gccatgttga cattgattat tgactagtta ttaatagtaa teaattaegg 12 0 
ggtcattagt tcatagccca tatatggagt tccgcgttac ataacttacg gtaaatggcc 18 0 
cgcctggctg accgcccaac gacccccgcc cattgaegtc aataatgacg tatgttccca 24 0 
tagtaacgee aatagggact ttccattgac gtcaatgggt ggagtattta eggtaaactg 3 00 
cccacttggc agtacatcaa gtgtatcata tgecaagtae gccccctatt gaegtcaatg 3 60 
aeggtaaatg gcccgcctgg cattatgccc agtacatgac cttatgggac tttcctactt 42 0 
ggcagtacat etaegtatta gtcategcta ttaccatggt gatgcggttt tggcagtaca 480 
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* tcaatgggcg tggatagcgg tttgactcac 
tcaatgggag tttgttttgg caccaaaatc 
ccgccccatt gacgcaaatg ggcggtaggc 
ctcgfcttagt gaaccgtcag atcgcctgga 
gaagacaccg ggaccgatcc agcctccgcg 
tcagctgctt atcgccatct tgcttttaag 
cacagtcttt tatggtgtac cagcttggag 
caagaatagg gatacttggg gaacaactca 
agtggccctt aatgttacag aaagctttga 
aatagaggat gtatggcaac tctttgagac 
attatgcatt actatgagat gcaataaaag 
aataacaaca acagcatcaa caacatcaac 
tgagactagt tcttgtatag cccaggataa 
aagctgtaaa ttcaacatga cagggttaaa 
ttggtactct gcagatttgg tatgtgaaca 
ttacatgaac cactgtaaca cttctgttat 
tgctattaga tttaggtatt gtgcacctcc 
aaattattca ggctfctatgc ctaaatgttc 
gatggagaca cagacttcta cttggtttgg 
ttatatttac tggcatggta gggataatag 
tctaacaatg aaatgtagaa gaccaggaaa 
tggattggtt ttccactcac aaccaatcaa 
tggaggaaaa tggaaggatg caataaaaga 
gtatactgga actaacaata ctgataaaat 
ggaagttacc ttcatgtgga caaattgcag 
gtttctaaat tgggtagaag ataggaatac 
aaggaattac gtgccatgtc atattagaca 
aaatgtttat ttgcctccaa gagagggaga 
catagcaaac atagattgga ttgatggaaa 
ggcagaactg tatcgattgg aattgggaga 
cttggccccc acagatgtga agaggtacac 
ggtctttgtg ctagggttct tgggttttct 
cagcctgacc ctcacggcac agtcccgaac 
acagctgttg gacgtggtca agagacaaca 
aaagaacctc cagactaggg tcactgccat 
gaatgcttgg ggatgtgcgt ttagacaagt 
aagtctaaca ccaaagtgga acaatgagac 
cttggaagaa aatataacag ccctcctaga 
gtatgaatta caaaagttga atagctggga 
ttggataaag tatatacaat atggagttta 
agtgatctat atagtacaaa tgctagctaa 
ttccccaccc tcttatttcc agcagaccca 
agaaggcaaa gaaagagacg gtggagaagg 
agaatatatc cactttctta ttcgtcagct 
ctgtaggact ttgctatcga gagtatacca 
tgcgacccta cagaggattc gagaagtcct 
gtggagctat ttccatgagg cggtccaggc 
gggcgcgtgg ggagacttat gggagactct 



ggggatttcc aagtctccac cccattgacg 54 0 
aacgggactt tccaaaatgt cgtaacaact 60 0 
gtgtacggtg ggaggtctat ataagcagag 66 0 
gacgccatcc acgctgtttt gacctccata 72 0 
ggccgcgcta agtatgggat gtcttgggaa 78 0 
tgtctatggg atctattgta ctctatatgt 84 0 
gaatgcgaca attcccctct tttgtgcaac 900 
gtgcctacca gataatggtg attattcaga 96 0 
tgcctggaat aatacagtca cagaacaggc 1020 
ctcaataaag ccttgtgtaa aattatcccc 10 80 
tgagacagat agatggggat tgacaaaatc 1140 
gacagcatca gcaaaagtag acatggtcaa 12 00 
ttgcacaggc ttggaacaag agcaaatgat 12 6 0 
aagagacaag aaaaaagagt acaatgaaac 132 0 
agggaataac actggtaatg aaagtagatg 13 8 0 
ccaagagtct tgtgacaaac attattggga 1440 
aggttatgct ttgcttagat gtaatgacac 15 00 
taaggtggtg gtctcttcat gcacaaggat 1560 
ctttaatgga actagagcag aaaatagaac 1620 
gactataatt agtttaaata agtattataa 16 8 0 
taagacagtt ttaccagtca ccattatgtc 174 0 
tgataggcca aagcaggcat ggtgttggtt 18 00 
ggtgaagcag accattgtca aacatcccag 1860 
caatttgacg gctcctggag gaggagatcc 1920 
aggagagttc ctctactgta aaatgaattg 198 0 
agctaaccag aagccaaagg aacagcataa 204 0 
aataatcaac acttggcata aagtaggcaa 2100 
cctcacgtgt aactccacag tgaccagtct 216 0 
ccaaactaat atcaccatga gtgcagaggt 222 0 
ttataaatta gtagagatca ctccaattgg 22 80 
tactggtggc acctcaagaa ataaaagagg 234 0 
cgcaacggca ggttctgcaa tgggagccgc 24 00 
tttattggct gggatagt cc aacagcagca 2460 
agaattgttg cgactgaccg tctggggaac 2520 
cgagaagtac ttaaaggacc aggcgcagct 2580 
ctgccacact actgtaccat ggccaaatgc 264 0 
ttggcaagag tgggagcgaa aggttgactt 2 700 
9g3Lggca.ca.3L attcaacaag agaagaacat 2 760 
tgtgtttggc aattggtttg accttgcttc 2 82 0 
tatagttgta ggagtaatac tgttaagaat 2 88 0 
gttaaggcag gggtataggc cagtgttctc 2 94 0 
tatccaacag gacccggcac tgccaaccag 3 000 
cggtggcaac agctcctggc cttggcagat 3 06 0 
tattagactc ttgacttggc tattcagtaa 312 0 
gatcctccaa ccaatactcc agaggctctc 318 0 
caggactgaa ctgacctacc tacaatatgg 3240 
cgtctggaga tctgcgacag agactcttgc 33 0 0 
taggagaggt ggaagatgga tactcgcaat 336 0 
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occcaggagg attagacaag ggcttgagct cactctcttg tgagggacag agaattcgga 3420 
tccactagtt ctagactcga gggggggccc ggtacgagcg cttagctagc tagagaccac 348 0 
ctcccctgcg agctaagctg gacagccaat gacgggtaag agagtgacat ttttcactaa 3540 
cctaagacag gagggccgtc agagctactg cctaatccaa agacgggtaa aagtgataaa 3 600 
aatgtatcac tccaacctaa gacaggcgca gcttccgagg gatttgtcgt ctgttttata 3660 
tatatttaaa agggtgacct gtccggagcc gtgctgcccg gatgatgtct tggtctagac 372 0 
tcgagggggg gcccggtacg atccagatct gctgtgcctt ctagttgcca gccatctgtt 3 780 
gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 3840 
taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 390 0 
ggggtggggc agcacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 3960 
gcggtgggct ctatgggtac ccaggtgctg aagaattgac ccggttcctc ctgggccaga 4020 
aagaagcagg cacatcccct tctctgtgac acaccctgtc cacgcccctg gttcttagtt 4080 
ccagccccac tcataggaca ctcatagctc aggagggctc cgccttcaat cccacccgct 414 0 
aaagtacttg gagcggtctc tccctccctc atcagcccac caaaccaaac ctagcctcca 42 00 
agagtgggaa gaaattaaag caagataggc tattaagtgc agagggagag aaaatgcctc 4260 
caacatgtga ggaagtaatg agagaaatca tagaatttct tccgcttcct cgctcactga 432 0 
ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 4380 
acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 4440 
aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 4500 
tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 4560 
aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 462 0 
gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc 468 0 
acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 4 74 0 
accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 4800 
ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 486 0 
gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 492 0 
gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 498 0 
ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 5040 
gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 5100 
cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 516 0 
cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 522 0 
gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 52 8 0 
tctatttcgt tcatccatag ttgcctgact ccgggggggg ggggcgctga ggtctgcctc 534 0 
gtgaagaagg tgttgctgac tcataccagg cctgaatcgc cccatcatcc agccagaaag 5400 
tgagggagcc acggttgatg agagctttgt tgtaggtgga ccagttggtg attttgaact 5460 
tttgctttgc cacggaacgg tctgcgttgt cgggaagatg cgtgatctga tccttcaact 552 0 
cagcaaaagt tcgatttatt caacaaagcc gccgtcccgt caagtcagcg taatgctctg 558 0 
ccagtgttac aaccaattaa ccaattctga ttagaaaaac tcatcgagca tcaaatgaaa 5640 
ctgcaattta ttcatatcag gattatcaat accatatttt tgaaaaagcc gtttctgtaa 5700 
tgaaggagaa aactcaccga ggcagttcca taggatggca agatcctggt atcggtctgc 5760 
gattccgact cgtccaacat caatacaacc tattaatttc ccctcgtcaa aaataaggtt 582 0 
atcaagtgag aaat caeca t gagtgacgac tgaatccggt gagaatggca aaagcttatg 5880 
catttctttc cagacttgtt caacaggcca gccattacgc tegtcatcaa aatcactcgc 5940 
atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg agacgaaata cgcgatcgct 6000 
gttaaaagga caattacaaa caggaatcga atgeaacegg cgcaggaaca ctgccagcgc 6 060 
atcaacaata ttttcacctg aatcaggata ttcttctaat acctggaatg ctgttttccc 612 0 
ggggatcgea gtggtgagta accatgcatc atcaggagta eggataaaat gcttgatggt 618 0 
eggaagagge ataaattccg teagecagtt tagtctgacc atctcatctg taacatcatt 624 0 
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ggcaacgcta cctttgccat gtttcagaaa caactctggc gcatcgggct tcccatacaa 63 00 
tcgatagatt gtcgcacctg attgcccgac attatcgcga gcccatttat acccatataa 63 60 
atcagcatcc atgttggaat ttaatcgcgg cctcgagcaa gacgtttccc gttgaatatg 642 0 
gctcataaca ccccttgtat tactgtttat gtaagcagac agttttattg ttcatgatga 6480 
tatattttta tcttgtgcaa tgtaacatca gagattttga gacacaacgt ggctttcccc 654 0 
ccccccccat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga 6600 
atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc 6660 
tgacgtctaa gaaaccatta ttatcatgac attaacctat aaaaataggc gtatcacgag 6 72 0 
gccctttcgt ctcgcgcgtt tcggtgatga cggtgaaaac ctctgacaca tgcagctccc 6780 
ggagacggtc acagcttgtc tgtaagcgga tgccgggagc agacaagccc gtcagggcgc 6 840 
gtcagcgggt gttggcgggt gtcggggctg gcttaactat gcggcatcag agcagattgt 69 00 
actgagagtg caccatatgc ggtgtgaaat accgcacaga tgcgtaagga gaaaataccg 6960 
catcagattg gctattgg 6978 



<210> 16 
<211> 879 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SIV gpl60env 
IN PLASMID CMVkan/R-R-SIVgpl 60 CTH 

<400> 16 

Met Gly Cys Leu Gly Asn Gin Leu Leu He Ala He Leu Leu Leu Ser 
15 10 15 

Val Tyr Gly He Tyr Cys Thr Leu Tyr Val Thr Val Phe Tyr Gly Val 

20 25 30 

Pro Ala Trp Arg Asn Ala Thr He Pro Leu Phe Cys Ala Thr Lys Asn 

35 40 45 

Arg Asp Thr Trp Gly Thr Thr Gin Cys Leu Pro Asp Asn Gly Asp Tyr 
50 55 60 

Ser Glu Val Ala Leu Asn Val Thr Glu Ser Phe Asp Ala Trp Asn Asn 
65 70 75 80 

Thr Val Thr Glu Gin Ala He Glu Asp Val Trp Gin Leu Phe Glu Thr 

85 90 95 

Ser He Lys Pro Cys Val Lys Leu Ser Pro Leu Cys He Thr Met Arg 

100 105 110 

Cys Asn Lys Ser Glu Thr Asp Arg Trp Gly Leu Thr Lys Ser He Thr 
115 120 125 
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Thr Thr Ala Ser 
130 

Val Asn Glu Thr 
145 

Glu Gin Glu Gin 



Arg Asp Lys Lys 

180 

Val Cys Glu Gin 
195 

Asn His Cys Asn 
210 

Trp Asp Ala lie 
225 

Leu Arg Cys Asn 



Lys Val Val Val 

260 

Thr Trp Phe Gly 
275 

Tyr Trp His Gly 
290 

Tyr Asn Leu Thr 
305 

Pro Val Thr lie 



Asp Arg Pro Lys 

340 

Ala lie Lys Glu 
355 

Gly Thr Asn Asn 
370 



Thr Thr Ser Thr 
135 

Ser Ser Cys lie 
150 

Met lie Ser Cys 
165 

Lys Glu Tyr Asn 



Gly Asn Asn Thr 

200 

Thr Ser Val lie 
215 

Arg Phe Arg Tyr 
230 

Asp Thr Asn Tyr 
245 

Ser Ser Cys Thr 



Phe Asn Gly Thr 

280 

Arg Asp Asn Arg 
295 

Met Lys Cys Arg 
310 

Met Ser Gly Leu 
325 

Gin Ala Trp Cys 



Val Lys Gin Thr 

360 

Thr Asp Lys lie 
375 



Thr Ala Ser Ala 

140 

Ala Gin Asp Asn 
155 

Lys Phe Asn Met 
170 

Glu Thr Trp Tyr 
185 

Gly Asn Glu Ser 



Gin Glu Ser Cys 

220 

Cys Ala Pro Pro 
235 

Ser Gly Phe Met 
250 

Arg Met Met Glu 
265 

Arg Ala Glu Asn 



Thr lie lie Ser 

300 

Arg Pro Gly Asn 
315 

Val Phe His Ser 
330 

Trp Phe Gly Gly 
345 

He Val Lys His 



Asn Leu Thr Ala 

380 



Lys Val Asp Met 



Cys Thr Gly Leu 

160 

Thr Gly Leu Lys 
175 

Ser Ala Asp Leu 
190 

Arg Cys Tyr Met 
205 

Asp Lys His Tyr 



Gly Tyr Ala Leu 

240 

Pro Lys Cys Ser 
255 

Thr Gin Thr Ser 
270 

Arg Thr Tyr He 
285 

Leu Asn Lys Tyr 



Lys Thr Val Leu 

320 

Gin Pro He Asn 
335 

Lys Trp Lys Asp 
350 

Pro Arg Tyr Thr 
365 

Pro Gly Gly Gly 



23 



WO 02/099101 



PCT/US02/17258 



Asp Pro Glu Val 
385 

Tyr Cys Lys Met 



Ala Asn Gin Lys 

420 

His He Arg Gin 
435 

Tyr Leu Pro Pro 
450 

Ser Leu He Ala 
465 

Thr Met Ser Ala 



Tyr Lys Leu Val 

500 

Lys Arg Tyr Thr 
515 

Val Leu Gly Phe 
530 

Ala Ala Ser Leu 
545 

He Val Gin Gin 



Glu Leu Leu Arg 

580 

Val Thr Ala He 
595 

Trp Gly Cys Ala 
610 

Asn Ala Ser Leu 
625 



Thr Phe Met Trp 
390 

Asn Trp Phe Leu 
405 

Pro Lys Glu Gin 



He He Asn Thr 

440 

Arg Glu Gly Asp 
455 

Asn He Asp Trp 
470 

Glu Val Ala Glu 
485 

Glu He Thr Pro 



Thr Gly Gly Thr 

520 

Leu Gly Phe Leu 
535 

Thr Leu Thr Ala 
550 

Gin Gin Gin Leu 
565 

Leu Thr Val Trp 



Glu Lys Tyr Leu 

600 

Phe Arg Gin Val 
615 

Thr Pro Lys Trp 
630 



Thr Asn Cys Arg 
395 

Asn Trp Val Glu 
410 

His Lys Arg Asn 
425 

Trp His Lys Val 



Leu Thr Cys Asn 

460 

He Asp Gly Asn 
475 

Leu Tyr Arg Leu 
490 

He Gly Leu Ala 
505 

Ser Arg Asn Lys 



Ala Thr Ala Gly 

540 

Gin Ser Arg Thr 
555 

Leu Asp Val Val 
570 

Gly Thr Lys Asn 
585 

Lys Asp Gin Ala 

Cys His Thr Thr 

620 

Asn Asn Glu Thr 
635 



Gly Glu Phe Leu 

400 

Asp Arg Asn Thr 
415 

Tyr Val Pro Cys 
430 

Gly Lys Asn Val 
445 

Ser Thr Val Thr 



Gin Thr Asn He 

480 

Glu Leu Gly Asp 
495 

Pro Thr Asp Val 
510 

Arg Gly Val Phe 
525 

Ser Ala Met Gly 



Leu Leu Ala Gly 

560 

Lys Arg Gin Gin 
575 

Leu Gin Thr Arg 
590 

Gin Leu Asn Ala 
605 

Val Pro Trp Pro 



Trp Gin Glu Trp 

640 
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Glu Arg Lys Val 



Glu Ala Gin lie 

660 

Asn Ser Trp Asp 
675 

Lys Tyr lie Gin 
690 

Arg lie Val lie 
705 

Tyr Arg Pro Val 



lie Gin Gin Asp 

740 

Gly Gly Glu Gly 
755 

lie His Phe Leu 
770 

Ser Asn Cys Arg 
785 

lie Leu Gin Arg 



Arg Thr Glu Leu 

820 

Ala Val Gin Ala 
835 

Trp Gly Asp Leu 
850 

Ala lie Pro Arg 
865 



<210> 17 



Asp Phe Leu Glu 
645 

Gin Gin Glu Lys 



Val Phe Gly Asn 

680 

Tyr Gly Val Tyr 
695 

Tyr lie Val Gin 
710 

Phe Ser Ser Pro 
725 

Pro Ala Leu Pro 



Gly Gly Asn Ser 

760 

lie Arg Gin Leu 
775 

Thr Leu Leu Ser 
790 

Leu Ser Ala Thr 
805 

Thr Tyr Leu Gin 



Val Trp Arg Ser 

840 

Trp Glu Thr Leu 
855 

Arg He Arg Gin 
870 



Glu Asn He Thr 
650 

Asn Met Tyr Glu 
665 

Trp Phe Asp Leu 



He Val Val Gly 

700 

Met Leu Ala Lys 
715 

Pro Ser Tyr Phe 
730 

Thr Arg Glu Gly 
745 

Ser Trp Pro Trp 



lie Arg Leu Leu 

780 

Arg Val Tyr Gin 
795 

Leu Gin Arg He 
810 

Tyr Gly Trp Ser 
825 

Ala Thr Glu Thr 



Arg Arg Gly Gly 

860 

Gly Leu Glu Leu 
875 



Ala Leu Leu Glu 
655 

Leu Gin Lys Leu 
670 

Ala Ser Trp He 
685 

Val He Leu Leu 



Leu Arg Gin Gly 

720 

Gin Gin Thr His 
735 

Lys Glu Arg Asp 
750 

Gin He Glu Tyr 
765 

Thr Trp Leu Phe 



He Leu Gin Pro 

800 

Arg Glu Val Leu 
815 

Tyr Phe His Glu 
830 

Leu Ala Gly Ala 
845 

Arg Trp He Leu 



Thr Leu Leu 
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* <?2lT> 271 
<212> PRT 

<213> Escherichia coli 



<400> 17 
Met Ser His lie 
1 

Ser Asn Met Asp 

20 

Val Gly Gin Ser 

35 

Ala Pro Glu Leu 
50 

Val Thr Asp Glu 
65 

Leu Pro Thr lie 



Leu Thr Thr Ala 

100 

Tyr Pro Asp Ser 
115 

Arg Arg Leu His 
130 

Arg Val Phe Arg 
145 

Val Asp Ala Ser 



Gin Val Trp Lys 

180 

Val Val Thr His 
195 

Gly Lys Leu lie 
210 

Arg Tyr Gin Asp 



Gin Arg Glu Thr 

5 

Ala Asp Leu Tyr 



Gly Ala Thr lie 

40 

Phe Leu Lys His 

55 

Met Val Arg Leu 
70 

Lys His Phe lie 
85 

lie Pro Gly Lys 



Gly Glu Asn lie 

120 

Ser lie Pro Val 
135 

Leu Ala Gin Ala 
150 

Asp Phe Asp Asp 
165 

Glu Met His Lys 



Gly Asp Phe Ser 

200 

Gly Cys lie Asp 
215 

Leu Ala lie Leu 



Ser Cys Ser Arg 
10 

Gly Tyr Lys Trp 
25 

Tyr Arg Leu Tyr 



Gly Lys Gly Ser 

60 

Asn Trp Leu Thr 

75 

Arg Thr Pro Asp 
90 

Thr Ala Phe Gin 
105 

Val Asp Ala Leu 



Cys Asn Cys Pro 

140 

Gin Ser Arg Met 
155 

Glu Arg Asn Gly 
170 

Leu Leu Pro Phe 
185 

Leu Asp Asn Leu 



Val Gly Arg Val 

220 

Trp Asn Cys Leu 



Pro Arg Leu Asn 

15 

Ala Arg Asp Asn 
30 

Gly Lys Pro Asp 
45 

Val Ala Asn Asp 



Glu Phe Met Pro 

80 

Asp Ala Trp Leu 

95 

Val Leu Glu Glu 
110 

Ala Val Phe Leu 
125 

Phe Asn Ser Asp 



Asn Asn Gly Leu 

160 

Trp Pro Val Glu 
175 

Ser Pro Asp Ser 
190 

lie Phe Asp Glu 
205 

Gly lie Ala Asp 



Gly Glu Phe Ser 
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Pro Ser Leu Gin Lys Arg Leu Phe Gin 

245 

Asp Met Asn Lys Leu Gin Phe His Leu 

260 265 



PCT7US02/17258 

235 240 

Lys Tyr Gly lie Asp Asn Pro 
250 255 

Met Leu Asp Glu Phe Phe 

270 



<210> 18 
<211> 2640 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: DNA sequence 
of mutated SIV gene in construct 
CMVkan/R-R-SIVgpl60 CTE 



<400> 18 

atgggatgtc ttgggaatca gctgcttatc 
tattgtactc tatatgtcac agtcttttat 
cccctctttt gtgcaaccaa gaatagggat 
aatggtgatt attcagaagt ggcccttaat 
acagtcacag aacaggcaat agaggatgta 
tgtgtaaaat tatccccatt atgcattact 
tggggattga caaaatcaat aacaacaaca 
aaagtagaca tggtcaatga gactagttct 
gaacaagagc aaatgataag ctgtaaattc 
aaagagtaca atgaaacttg gtactctgca 
ggtaatgaaa gtagatgtta catgaaccac 
gacaaacatt attgggatgc tattagattt 
cttagatgta atgacacaaa ttattcaggc 
tcttcatgca caaggatgat ggagacacag 
agagcagaaa atagaactta tatttactgg 
ttaaataagt attataatct aacaatgaaa 
ccagtcacca ttatgtctgg attggttttc 
caggcatggt gttggtttgg aggaaaatgg 
attgtcaaac atcccaggta tactggaact 
cctggaggag gagatccgga agttaccttc 
tactgtaaaa tgaattggtt tctaaattgg 
ccaaaggaac agcataaaag gaattacgtg 
tggcataaag taggcaaaaa tgtttatttg 
tccacagtga ccagtctcat agcaaacata 
accatgagtg cagaggtggc agaactgtat 
gagatcactc caattggctt ggcccccaca 
tcaagaaata aaagaggggt ctttgtgcta 
tctgcaatgg gagccgccag cctgaccctc 



gccatcttgc ttttaagtgt ctatgggatc 6 0 
ggtgtaccag cttggaggaa tgcgacaatt 12 0 
acttggggaa caactcagtg cctaccagat 18 0 
gttacagaaa gctttgatgc ctggaataat 24 0 
tggcaactct ttgagacctc aataaagcct 3 00 
atgagatgca ataaaagtga gacagataga 3 60 
gcatcaacaa catcaacgac agcatcagca 42 0 
tgtatagccc aggataattg cacaggcttg 48 0 
aacatgacag ggttaaaaag agacaagaaa 54 0 
gatttggtat gtgaacaagg gaataacact 600 
tgtaacactt ctgttatcca agagtcttgt 660 
aggtattgtg cacctccagg ttatgctttg 72 0 
tttatgccta aatgttctaa ggtggtggtc 78 0 
acttctactt ggtttggctt taatggaact 84 0 
catggtaggg ataataggac tataattagt 90 0 
tgtagaagac caggaaataa gacagtttta 960 
cactcacaac caatcaatga taggccaaag 102 0 
aaggatgcaa taaaagaggt gaagcagacc 108 0 
aacaatactg ataaaatcaa tttgacggct 114 0 
atgtggacaa attgcagagg agagttcctc 12 00 
gtagaagata ggaatacagc taaccagaag 1260 
ccatgtcata ttagacaaat aatcaacact 132 0 
cctccaagag agggagacct cacgtgtaac 13 8 0 
gattggattg atggaaacca aactaatatc 1440 
cgattggaat tgggagatta taaattagta 15 00 
gatgtgaaga ggtacactac tggtggcacc 1560 
gggttcttgg gttttctcgc aacggcaggt 162 0 
acggcacagt cccgaacttt attggctggg 16 8 0 
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r cfta^tccaac agcagcaaca gctgttggac gtggtcaaga gacaacaaga attgttgcga 1740 

ctgaccgtct ggggaacaaa gaacctccag actagggtca ctgccatcga gaagtactta 18 00 

aaggaccagg cgcagctgaa tgcttgggga tgtgcgttta gacaagtctg ccacactact 1860 

gtaccatggc caaatgcaag tctaacacca aagtggaaca atgagacttg gcaagagtgg 192 0 

gagcgaaagg ttgacttctt ggaagaaaat ataacagccc tcctagagga ggcacaaatt 198 0 

caacaagaga agaacatgta tgaattacaa aagttgaata gctgggatgt gtttggcaat 2 04 0 

tggtttgacc ttgcttcttg gataaagtat atacaatatg gagtttatat agttgtagga 2100 

gtaatactgt taagaatagt gatctatata gtacaaatgc tagctaagtt aaggcagggg 216 0 

tataggccag tgfctctcttc cccaccctct tatttccagc agacccatat ccaacaggac 222 0 

ccggcactgc caaccagaga aggcaaagaa agagacggtg gagaaggcgg tggcaacagc 228 0 

tcctggcctt ggcagataga atatatccac tttcttattc gtcagcttat tagactcttg 2340 

acttggctat tcagtaactg taggactttg ctatcgagag tataccagat cctccaacca 240 0 

atactccaga ggctctctgc gaccctacag aggattcgag aagtcctcag gactgaactg 246 0 

acctacctac aatatgggtg gagctatttc catgaggcgg tccaggccgt ctggagatct 252 0 

gcgacagaga ctcttgcggg cgcgtgggga gacttatggg agactcttag gagaggtgga 258 0 

agatggatac tcgcaatccc caggaggatfc agacaagggc ttgagctcac tctcttgtga 264 0 



<210> 19 
<211> 813 
<212> DNA 

<213> Escherichia ,coli 
<400> 19 

atgagccata ttcaacggga aacgtcttgc 
gctgatttat atgggtataa atgggctcgc 
tatcgattgt atgggaagcc cgatgcgcca 
gttgccaatg atgttacaga tgagatggtc 
cttccgacca tcaagcattt tatccgtact 
atccccggga aaacagcatt ccaggtatta 
gttgatgcgc tggcagtgtt cctgcgccgg 
tttaacagcg atcgcgtatt tcgtctcgct 
gttgatgcga gtgattttga tgacgagcgt 
gaaatgcata agcttttgcc attctcaccg 
cttgataacc ttatttttga cgaggggaaa 
ggaatcgcag accgatacca ggatcttgcc 
ccttcattac agaaacggct ttttcaaaaa 
ttgcagtttc atttgatgct cgatgagttt 



tcgaggccgc gattaaattc caacatggat 60 
gataatgtcg ggcaatcagg tgcgacaatc 12 0 
gagttgtttc tgaaacatgg caaaggtagc 180 
agactaaact ggctgacgga atttatgcct 240 
cctgatgatg catggttact caccactgcg 3 00 
gaagaatatc ctgattcagg tgaaaatatt 360 
ttgcattcga ttcctgtttg taattgtcct 420 
caggcgcaat cacgaatgaa taacggtttg 4 80 
aatggctggc ctgttgaaca agtctggaaa 54 0 
gattcagtcg tcactcatgg tgatttctca 600 
ttaataggtt gtattgatgt tggacgagtc 660 
atcctatgga actgcctcgg tgagttttct 72 0 
tatggtattg ataatcctga tatgaataaa 780 
ttc 813 
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